Skip to main content
QMSQMS
QMS
  • Welcome to your QMS
  • Quality Manual
  • Procedures
  • Records
  • Legit.Health Plus Version 1.1.0.0
  • Legit.Health Plus Version 1.1.0.1
  • Legit.Health version 2.1 (Legacy MDD)
  • Legit.Health US Version 1.1.0.0
  • Legit.Health Utilities
  • Licenses and accreditations
  • Applicable Standards and Regulations
  • BSI Non-Conformities
    • Technical Review
    • Clinical Review
      • Round 1
        • Item 0: Background & Action Plan
        • Item 1: CER Update Frequency
        • Item 2: Device Description & Claims
        • Item 3: Clinical Data
        • Item 4: Usability
        • Item 5: PMS Plan
        • Item 6: PMCF Plan
        • Item 7: Risk
        • completed-tasks
          • task-3b10-legacy-pms-document-hierarchy-refactor
          • task-3b11-sme-coverage-subspecialty-documentation
          • task-3b12-phase-1-exploratory-per-bucket-c-feature
          • task-3b13-man-2025-cep-cip-completeness
          • task-3b14-ifu-integration-requirements-verification
          • task-3b4-mrmc-dark-phototypes
          • task-3b6-surrogate-endpoint-literature-review
            • Appraisal log — CRIT1–7 rolling table
            • Do we need this task?
            • Integration map — propagation of the surrogate-endpoint validity review
            • references
              • diagnostic-accuracy
                • Conic 2018 — Impact of melanoma surgical timing on survival (NCDB)
                • Daneshjou 2022 — Disparities in dermatology AI performance on a diverse clinical image set (DDI) [BALANCING]
                • Dick 2019 — Accuracy of computer-aided diagnosis of melanoma: a meta-analysis
                • Esteva 2017 — Dermatologist-level classification of skin cancer with deep neural networks
                • Freeman 2020 — Algorithm-based smartphone apps for skin cancer risk: BMJ systematic review [BALANCING]
                • Gershenwald 2017 — AJCC 8th edition: melanoma staging and survival gradient
                • Haenssle 2018 — Man against machine: CNN vs 58 dermatologists for melanoma recognition
                • Haenssle 2020 — Man against machine reloaded: market-approved CNN (Moleanalyzer Pro) vs 96 dermatologists
                • Han 2018 — Clinical-image classification for benign and malignant tumours (cross-ethnicity) [BALANCING]
                • Liu 2020 — A deep learning system for differential diagnosis of skin diseases
                • Salinas 2024 — Systematic review and meta-analysis of AI vs. clinicians for skin cancer diagnosis
                • Tschandl 2020 — Human–computer collaboration for skin cancer recognition
                • Winkler 2023 — Dermatologists cooperating with a CNN: prospective clinical study
              • referral-optimisation
              • severity-assessment
            • Research prompts — external deep-research tools
            • Surrogate-Endpoint Validity in Dermatology AI — Structured Literature Review
          • task-3b7-icd-per-epidemiological-group-vv
          • task-3b8-safety-confirmation-column-definition
          • task-3b9-legacy-pms-conclusions-into-plus-pms-plan
        • Coverage matrix
        • resources
        • Task 3b-5: Autoimmune and Genodermatoses Triangulated-Evidence Package
      • Evidence rank & phases
      • Pre-submission review of R-TF-015-001 CEP and R-TF-015-003 CER
  • Pricing
  • Public tenders
  • Trainings
  • BSI Non-Conformities
  • Clinical Review
  • Round 1
  • completed-tasks
  • task-3b6-surrogate-endpoint-literature-review
  • references
  • diagnostic-accuracy
  • Freeman 2020 — Algorithm-based smartphone apps for skin cancer risk: BMJ systematic review [BALANCING]

Freeman 2020 — Algorithm-based smartphone apps for skin cancer risk: BMJ systematic review [BALANCING]

Citation​

Freeman K, Dinnes J, Chuchu N, Takwoingi Y, Bayliss SE, Matin RN, et al. Algorithm based smartphone apps to assess risk of skin cancer in adults: systematic review of diagnostic accuracy studies. BMJ. 2020 Feb 10;368:m127. DOI: 10.1136/bmj.m127. PMID 32041693.

Study design and population​

Cochrane-style systematic review (PROSPERO CRD42016033595); 9 studies evaluating 6 commercial algorithm-based smartphone apps for adult skin-cancer risk triage; QUADAS-2 risk-of-bias assessment.

Reported metrics​

  • SkinVision (3 studies; n = 267 lesions; 66 (pre)malignant): pooled sensitivity 80 % (95 % CI 63–92); specificity 78 % (95 % CI 67–87)
  • Revised SkinVision (pigmented-only dataset): sensitivity 88 % (95 % CI 70–98); specificity 79 % (95 % CI 70–86)
  • SkinScan: 0 % sensitivity for melanoma (n = 5)
  • High risk of bias across primary studies; high unevaluable-image rates

Surrogate-to-outcome linkage​

BALANCING reference. Quantifies the limit of current algorithm-based triage apps: heterogeneous performance, missed melanomas in some CE-marked products, and high unevaluable-image rates. Provides direct evidence that diagnostic-accuracy claims cannot be assumed generic across AI products — each device requires its own clinical-data package. Directly relevant to the MDR benefit–risk argument.

CRIT1–7 appraisal​

CriterionScoreJustification
CRIT1 Relevance3Direct — systematic review of AI / app-based skin-cancer diagnostic accuracy.
CRIT2 Methodology3PROSPERO-registered; PRISMA; QUADAS-2 risk-of-bias assessment.
CRIT3 Reporting3Pooled sens/spec with 95 % CIs; per-app stratification.
CRIT4 Applicability3Directly addresses the intended-use concerns (community deployment, CE-marked products).
CRIT5 Evidence weight3Systematic review — highest tier.
CRIT6 Risk of bias2Constituent primary studies heterogeneous; clinician-recruited rather than user-recruited; high unevaluable rates.
CRIT7 Contribution3MANDATORY balancing reference — establishes the device-specific clinical-data requirement that the EU MDR and MDCG 2020-1 Pillar 2/3 enforce.

Aggregate: very strong (as balancing reference).

Limitations and notes​

Primary studies largely enriched-prevalence, clinician-recruited (not user-recruited); generalisation to consumer-deployment scenarios limited; not all CE-marked products evaluated.

Strength as anchor​

Mandatory balancing reference. BSI Erin has historically flagged selective citation as a risk; inclusion of Freeman 2020 demonstrates transparent acknowledgement that "AI dermatology" is not homogeneous — each device must be evaluated on its own performance data. Supports the justification for our own CIP/CIR evidence stack over reliance on surrogate literature.

Previous
Esteva 2017 — Dermatologist-level classification of skin cancer with deep neural networks
Next
Gershenwald 2017 — AJCC 8th edition: melanoma staging and survival gradient
  • Citation
  • Study design and population
  • Reported metrics
  • Surrogate-to-outcome linkage
  • CRIT1–7 appraisal
  • Limitations and notes
  • Strength as anchor
All the information contained in this QMS is confidential. The recipient agrees not to transmit or reproduce the information, neither by himself nor by third parties, through whichever means, without obtaining the prior written permission of Legit.Health (AI Labs Group S.L.)