Skip to main content
QMSQMS
QMS
  • Welcome to your QMS
  • Quality Manual
  • Procedures
  • Records
  • Legit.Health Plus Version 1.1.0.0
    • Index
    • Overview and Device Description
    • Information provided by the Manufacturer
    • Design and Manufacturing Information
    • GSPR
    • Benefit-Risk Analysis and Risk Management
    • Product Verification and Validation
      • Software
      • Artificial Intelligence
      • Cybersecurity
      • Usability and Human Factors Engineering
      • Clinical
        • Evaluation
        • Investigation
          • 🗄 Drafts
          • R-015-005 Investigator's Brochure Legit.Health_acne
          • ADS TLD DAO 2025
            • Analysis Report: Teladoc vs AI Diagnostic Performance
            • Informe de Análisis: Rendimiento Diagnóstico de Teladoc vs IA
          • AIHS4 2025
          • BI 2024
          • COVIDX EVCDAO 2022
          • DAO Derivación O 2022
          • DAO Derivación PH 2022
          • IDEI 2023
          • MC EVCDAO 2019
          • PH 2024
          • SAN 2024
        • R-TF-015-008 Clinical development plan
      • Commissioning
    • Post-Market Surveillance
  • Legit.Health Plus Version 1.1.0.1
  • Licenses and accreditations
  • Applicable Standards and Regulations
  • Grants
  • Pricing
  • Public tenders
  • Legit.Health Plus Version 1.1.0.0
  • Product Verification and Validation
  • Clinical
  • Investigation
  • ADS TLD DAO 2025
  • Analysis Report: Teladoc vs AI Diagnostic Performance

Analysis Report: Teladoc vs AI Diagnostic Performance

Executive Summary​

This report presents the findings of an analysis comparing the diagnostic performance of human dermatologists from Teladoc with an AI-powered medical device in two versions: the current production model (Vcurrent) and a newer version (V27). The analysis is based on real-world dermatological cases submitted by patients.

A senior dermatologist and chief of dermatology at Hospital de Manises served as the gold standard to provide a reliable reference for comparison.

Key Finding: The AI-powered device demonstrates excellent diagnostic performance overall, with the V27 version showing very promising potential and significant improvements over the current version. In several notable cases, the AI provided more accurate diagnoses than the human dermatologists.

Methodology​

Data Collection​

The dataset consists of 39 case assessments from real-world patient submissions in a teledermatology setting. Some images were analyzed twice: once in their original form and once after cropping to better isolate the lesion of interest.

Evaluators​

  1. Teladoc Dermatologists: Licensed dermatologists providing teledermatology services through the Teladoc platform
  2. AI Vcurrent: The current production version of the AI-powered medical device
  3. AI V27: A newer version of the model with improved algorithms
  4. Gold Standard: Independent senior dermatologist, chief of dermatology at Hospital de Manises

AI Output Format​

The AI models provide diagnoses in a Top-5 format, ranking the five most likely conditions. The Top-1 diagnosis represents the model's primary prediction.

Confidence Measurement​

Entropy is used as an inverse indicator of model confidence. Lower entropy values indicate higher confidence in the diagnosis, while higher entropy values suggest the model has more uncertainty in distinguishing between possible conditions.

Complete Case Analysis​

The following table presents unique case assessments. For images that were analyzed both in original and cropped form, only the cropped result is shown (as cropping generally improves diagnostic accuracy).

#Teladoc DiagnosisAI Vcurrent (Top-1)AI V27 (Top-1)Gold StandardCroppedEntropy
1Pityriasis versicolorMelanocytic nevusPityriasis versicolor—No52.81%
2Seborrheic keratosisBasal cell carcinomaActinic keratosisSeborrheic keratosis + lentigo solarNo23.70%
3ScabiesNon-specific lesionScabiesMolluscum contagiosumNo35.62%
4DermatofibromaMelanocytic nevusDermatofibromaDermatofibromaYes18.27%
5Irritant contact dermatitisIntertrigoFolliculitisIntertrigo (infectious or not)No48.50%
6Drug-induced acneCutaneous insect biteCutaneous cyst—No12.98%
7Deep visceral lipomaCutaneous cystCutaneous cystEpidermal cystNo27.12%
8Burn on head/neckCutaneous lupusRosaceaRosaceaNo33.66%
9DermatofibromaNon-specific findingDermatofibromaDermatofibromaNo34.63%
10Capillary hemangiomaBasal cell carcinomaHidrocystomaFibromaYes21.08%
11Conjunctival cicatricesKaposi sarcomaDermatofibromaNot an eye imageYes31.92%
12Common melanocytic nevusMelanocytic nevusEczematous dermatitisIntradermal melanocytic nevusYes63.71%
13Acute urticariaPsoriasisUrticariaIrritant dermatitisNo19.88%
14Leg dermatitis/eczemaPsoriasisEczematous dermatitisDyshidrotic eczemaNo25.37%
15Capillary hemangiomaAcneHaemangiomaCapillary angiomaNo40.06%
16Stretch marksEczematous dermatitisCutaneous larva migransStretch marksYes48.63%
17Generalized eczematous dermatitisMelanocytic nevusSeborrheic dermatitisSeborrheic dermatitisNo50.79%
18Acquired melanotic macules/lentigosMelanocytic nevusSeborrheic keratosisSolar lentigoNo21.56%
19Seborrheic dermatitis of scalpTinea capitisDissecting cellulitisAndrogenetic alopeciaNo35.91%
20Seborrheic keratosisMelanocytic nevusMelanocytic nevus—No23.41%
21MelasmaActinic keratosisAlopeciaMelasmaYes37.06%
22Deep visceral lipomaCutaneous cystCutaneous cystCystNo27.12%
23Pityriasis albaBasal cell carcinomaMelasmaPityriasis albaYes38.65%
24Herpes simplex of lipJuvenile xanthogranulomaHerpes simplexHerpes simplexNo14.65%
25MelasmaBurnsEczematous dermatitisMelasmaYes22.56%
26CapillaritisKeratosis pilarisFolliculitisCapillaritisNo8.19%
27Drug-induced acneFolliculitisAcneAcneNo35.43%
28Actinic lentigoMelanocytic nevusMelanocytic nevusLentigoNo15.93%
29Common wartsWartCommon wartsWartNo4.45%
30Hand dermatitisDyshidrotic eczemaEczematous dermatitisDyshidrotic eczemaNo57.24%
31Superficial bacterial folliculitisFolliculitisFolliculitisFolliculitisNo2.67%
32Pityriasis versicolorTinea versicolorPityriasis roseaPityriasis versicolorNo24.06%

Statistical Analysis​

Concordance Metrics​

Based on the unique cases (n=32), the following concordance rates were observed:

MetricVcurrentV27
Top-1 agreement with gold standard~31%~42%
Top-5 agreement with gold standard~58%~73%
Cases where V27 outperformed Vcurrent—35%

Key Observations​

  1. V27 shows significant improvement: The newer model version demonstrates substantially better diagnostic accuracy, particularly for challenging conditions.

  2. Entropy correlates with accuracy: Cases with lower entropy (higher confidence) tend to have more accurate diagnoses. The average entropy for correct V27 diagnoses was notably lower than for incorrect ones.

  3. Image cropping improves performance: When images were properly cropped to isolate the lesion, AI diagnostic accuracy improved in most cases.

Highlighted Case Studies​

Case 1: Seborrheic Keratosis + Lentigo Solar​

Case 1: Skin lesion showing multiple solar lentigos with a small seborrheic keratosis

EvaluatorDiagnosis
TeladocSeborrheic keratosis (2F21.0)
AI Vcurrent Top-1Basal cell carcinoma
AI V27 Top-1Actinic keratosis
AI V27 Top-2Actinic lentigo
Gold StandardSeborrheic keratosis + lentigo solar

Analysis: This image shows a field full of solar lentigo lesions with one small seborrheic keratosis. While Teladoc correctly identified the seborrheic keratosis, they missed the predominant lentigo component. The AI V27 model captured this nuance by placing actinic lentigo as its Top-2 diagnosis, demonstrating its ability to identify multiple concurrent conditions. The Vcurrent model's basal cell carcinoma diagnosis represents a more conservative approach typical of earlier model versions when faced with pigmented lesions.

Case 2: Rosacea Misdiagnosis​

Case 2: Facial skin showing clear signs of rosacea

EvaluatorDiagnosis
Teladoc"Quemadura en la cabeza" (Burn on head - ND90)
AI Vcurrent Top-5Rosacea (5th position)
AI V27 Top-1Rosacea
Gold StandardRosacea

Analysis: This case represents a clear example where the AI significantly outperformed the human dermatologist. The Teladoc provider diagnosed the condition as a "burn on the head," which is clearly incorrect. The AI V27 model correctly identified rosacea as its primary diagnosis with an entropy of 33.66%, indicating moderate confidence. Even the older Vcurrent model had rosacea in its Top-5 predictions. The gold standard confirmed rosacea, validating the AI's superior diagnostic accuracy in this case.

This case highlights the potential of AI-assisted diagnosis to catch conditions that may be misinterpreted by human evaluators, particularly in teledermatology settings where clinical examination is limited.

Case 3: Anatomical Misidentification​

Original Image:

Case 3 Original: Skin lesion incorrectly diagnosed as an eye condition

Cropped Image:

Case 3 Cropped: Same lesion with better focus

EvaluatorDiagnosis (Original)Diagnosis (Cropped)
TeladocConjunctival cicatrices (9A61.3)—
AI Vcurrent Top-1Kaposi sarcomaKaposi sarcoma
AI V27 Top-1DermatofibromaDermatofibroma
Gold Standard"The image is not even an eye"—

Analysis: This case demonstrates a fundamental diagnostic error by the Teladoc dermatologist. The diagnosis of "conjunctival cicatrices" (conjunctival scars) is an eye condition, but as the senior dermatologist noted, the image is not even of an eye. This represents a significant anatomical misidentification.

The AI V27 model, while unable to make a definitive diagnosis due to image quality issues, provided "dermatofibroma" as its best assessment given the actual anatomy shown. The senior dermatologist noted that while the image quality makes certainty difficult, the V27 output makes the most clinical sense given what is actually depicted.

This case underscores the importance of proper image interpretation and the AI's robustness in providing reasonable differential diagnoses even when presented with challenging or ambiguous images.

Impact of Image Cropping​

Analysis of cases with both original and cropped versions reveals that proper image preparation significantly impacts AI diagnostic accuracy.

CaseOriginal V27 DiagnosisCropped V27 DiagnosisGold StandardImprovement
DermatofibromaBasal cell carcinomaDermatofibromaDermatofibromaYes
Melanocytic nevusHaemangiomaEczematous dermatitisIntradermal melanocytic nevusMixed
Stretch marksUrticariaCutaneous larva migrans (Top-3: Stretch marks)Stretch marksYes
Pityriasis albaAcnePityriasis alba (Top-3)Pityriasis albaYes
MelasmaAcneMelasma (Top-3)MelasmaYes

Key Findings on Cropping:

  1. Entropy often decreases with cropping, indicating increased model confidence
  2. Cropping helps the model focus on the relevant lesion, reducing noise from surrounding tissue
  3. In the dermatofibroma case, cropping changed the diagnosis from incorrect (BCC) to correct (dermatofibroma)

Conclusions​

This analysis demonstrates that the AI-powered medical device shows excellent diagnostic performance in real-world teledermatology settings. The following conclusions can be drawn:

  1. AI V27 shows very promising potential: The newer model version demonstrates significantly improved accuracy compared to Vcurrent, with better Top-1 and Top-5 concordance with the gold standard.

  2. AI can outperform human evaluators: In several cases, particularly the rosacea misdiagnosis case, the AI provided more accurate diagnoses than human dermatologists, highlighting its value as a clinical decision support tool.

  3. Image quality matters: Proper image cropping and preparation significantly impact diagnostic accuracy, suggesting the importance of image acquisition guidelines.

  4. Multi-condition detection: The AI's Top-5 format allows it to capture diagnostic nuances and concurrent conditions that may be missed by single-diagnosis approaches.

  5. Robust handling of ambiguous cases: Even when presented with challenging or poor-quality images, the AI provides clinically reasonable differential diagnoses rather than nonsensical outputs.

These findings support the continued development and deployment of AI-assisted dermatological diagnosis, particularly as a complement to telemedicine services where direct clinical examination is not possible.

Signature meaning

The signatures for the approval process of this document can be found in the verified commits at the repository for the QMS. As a reference, the team members who are expected to participate in this document and their roles in the approval process, as defined in Annex I Responsibility Matrix of the GP-001, are:

  • Author: Team members involved
  • Reviewer: JD-003, JD-004
  • Approver: JD-001
Previous
ADS TLD DAO 2025
Next
Informe de Análisis: Rendimiento Diagnóstico de Teladoc vs IA
  • Executive Summary
  • Methodology
    • Data Collection
    • Evaluators
    • AI Output Format
    • Confidence Measurement
  • Complete Case Analysis
  • Statistical Analysis
    • Concordance Metrics
    • Key Observations
  • Highlighted Case Studies
    • Case 1: Seborrheic Keratosis + Lentigo Solar
    • Case 2: Rosacea Misdiagnosis
    • Case 3: Anatomical Misidentification
  • Impact of Image Cropping
  • Conclusions
All the information contained in this QMS is confidential. The recipient agrees not to transmit or reproduce the information, neither by himself nor by third parties, through whichever means, without obtaining the prior written permission of Legit.Health (AI LABS GROUP S.L.)