Skip to main content
QMSQMS
QMS
  • Welcome to your QMS
  • Quality Manual
  • Procedures
  • Records
  • Legit.Health Plus Version 1.1.0.0
  • Legit.Health Plus Version 1.1.0.1
  • Legit.Health version 2.1 (Legacy MDD)
  • Legit.Health US Version 1.1.0.0
  • Legit.Health Utilities
  • Licenses and accreditations
  • Applicable Standards and Regulations
  • BSI Non-Conformities
    • Technical Review
    • Clinical Review
      • Round 1
        • Item 0: Background & Action Plan
        • Item 1: CER Update Frequency
        • Item 2: Device Description & Claims
        • Item 3: Clinical Data
        • Item 4: Usability
        • Item 5: PMS Plan
        • Item 6: PMCF Plan
        • Item 7: Risk
        • completed-tasks
          • task-3b10-legacy-pms-document-hierarchy-refactor
          • task-3b11-sme-coverage-subspecialty-documentation
          • task-3b12-phase-1-exploratory-per-bucket-c-feature
          • task-3b13-man-2025-cep-cip-completeness
          • task-3b14-ifu-integration-requirements-verification
          • task-3b4-mrmc-dark-phototypes
          • task-3b6-surrogate-endpoint-literature-review
            • Appraisal log — CRIT1–7 rolling table
            • Do we need this task?
            • Integration map — propagation of the surrogate-endpoint validity review
            • references
              • diagnostic-accuracy
                • Conic 2018 — Impact of melanoma surgical timing on survival (NCDB)
                • Daneshjou 2022 — Disparities in dermatology AI performance on a diverse clinical image set (DDI) [BALANCING]
                • Dick 2019 — Accuracy of computer-aided diagnosis of melanoma: a meta-analysis
                • Esteva 2017 — Dermatologist-level classification of skin cancer with deep neural networks
                • Freeman 2020 — Algorithm-based smartphone apps for skin cancer risk: BMJ systematic review [BALANCING]
                • Gershenwald 2017 — AJCC 8th edition: melanoma staging and survival gradient
                • Haenssle 2018 — Man against machine: CNN vs 58 dermatologists for melanoma recognition
                • Haenssle 2020 — Man against machine reloaded: market-approved CNN (Moleanalyzer Pro) vs 96 dermatologists
                • Han 2018 — Clinical-image classification for benign and malignant tumours (cross-ethnicity) [BALANCING]
                • Liu 2020 — A deep learning system for differential diagnosis of skin diseases
                • Salinas 2024 — Systematic review and meta-analysis of AI vs. clinicians for skin cancer diagnosis
                • Tschandl 2020 — Human–computer collaboration for skin cancer recognition
                • Winkler 2023 — Dermatologists cooperating with a CNN: prospective clinical study
              • referral-optimisation
              • severity-assessment
            • Research prompts — external deep-research tools
            • Surrogate-Endpoint Validity in Dermatology AI — Structured Literature Review
          • task-3b7-icd-per-epidemiological-group-vv
          • task-3b8-safety-confirmation-column-definition
          • task-3b9-legacy-pms-conclusions-into-plus-pms-plan
        • Coverage matrix
        • resources
        • Task 3b-5: Autoimmune and Genodermatoses Triangulated-Evidence Package
      • Evidence rank & phases
      • Pre-submission review of R-TF-015-001 CEP and R-TF-015-003 CER
  • Pricing
  • Public tenders
  • Trainings
  • BSI Non-Conformities
  • Clinical Review
  • Round 1
  • completed-tasks
  • task-3b6-surrogate-endpoint-literature-review
  • references
  • diagnostic-accuracy
  • Han 2018 — Clinical-image classification for benign and malignant tumours (cross-ethnicity) [BALANCING]

Han 2018 — Clinical-image classification for benign and malignant tumours (cross-ethnicity) [BALANCING]

Citation​

Han SS, Kim MS, Lim W, Park GH, Park I, Chang SE. Classification of the clinical images for benign and malignant cutaneous tumors using a deep learning algorithm. J Invest Dermatol. 2018 Jul;138(7):1529–1538. DOI: 10.1016/j.jid.2018.01.028. PMID 29428356.

Study design and population​

Retrospective multi-dataset validation of a ResNet-152 CNN trained on Asan (Korean, FST III–V) + Atlas datasets (~19,398 images, 12 diagnostic categories). External validation on Asan, Edinburgh (Caucasian) and Hallym test sets. 16 Korean dermatologists compared on a 480-image subset.

Reported metrics​

  • Asan internal test — AUC BCC 0.96 ± 0.01; SCC 0.83; IEC 0.82; melanoma 0.96
  • Edinburgh external test — AUC BCC 0.90; SCC 0.91; IEC 0.83; melanoma 0.88
  • Hallym external test — BCC sensitivity 87.1 % ± 6.0 %
  • 95 % CIs not reported (cross-validation SDs)

Surrogate-to-outcome linkage​

Cross-ethnicity external-validation evidence: melanoma AUC drop from 0.96 (Asian training/test) to 0.88 (Caucasian test) illustrates generalisation limits. Diagnostic accuracy as a proxy for appropriate biopsy is population-dependent; training-set ethnic composition materially affects performance, with implications for the surrogate-to-outcome chain in under-represented phenotypes.

CRIT1–7 appraisal​

CriterionScoreJustification
CRIT1 Relevance3Direct — classifier performance across ethnic / phototype groups.
CRIT2 Methodology2Multi-dataset external validation; head-to-head dermatologist comparison; no prospective deployment.
CRIT3 Reporting2Per-dataset AUCs with SDs; no parametric CIs.
CRIT4 Applicability3Directly relevant to MDR Annex I §17.2 intended-population generalisability.
CRIT5 Evidence weight1Retrospective multi-dataset validation.
CRIT6 Risk of bias2Training-set ethnically homogeneous (Korean); external datasets vary; no outcome follow-up.
CRIT7 Contribution3Complementary balancing reference (with Daneshjou 2022) — explicit cross-ethnicity generalisability quantification.

Aggregate: strong (as balancing reference).

Limitations and notes​

Ethnic / skin-tone heterogeneity handled via dataset comparison rather than FST stratification; external dataset sizes modest.

Strength as anchor​

Strong as a balancing reference. Complements Daneshjou 2022 by providing cross-ethnic (not just cross-FST) external-validation evidence. Confirms the spectrum-bias concern from Dick 2019 meta-analysis with population-level granularity.

Previous
Haenssle 2020 — Man against machine reloaded: market-approved CNN (Moleanalyzer Pro) vs 96 dermatologists
Next
Liu 2020 — A deep learning system for differential diagnosis of skin diseases
  • Citation
  • Study design and population
  • Reported metrics
  • Surrogate-to-outcome linkage
  • CRIT1–7 appraisal
  • Limitations and notes
  • Strength as anchor
All the information contained in this QMS is confidential. The recipient agrees not to transmit or reproduce the information, neither by himself nor by third parties, through whichever means, without obtaining the prior written permission of Legit.Health (AI Labs Group S.L.)