Skip to main content
QMSQMS
QMS
  • Welcome to your QMS
  • Quality Manual
  • Procedures
  • Records
  • Legit.Health Plus Version 1.1.0.0
  • Legit.Health Plus Version 1.1.0.1
  • Legit.Health version 2.1 (Legacy MDD)
  • Legit.Health US Version 1.1.0.0
  • Legit.Health Utilities
  • Licenses and accreditations
  • Applicable Standards and Regulations
  • BSI Non-Conformities
    • Technical Review
    • Clinical Review
      • Round 1
        • Item 0: Background & Action Plan
        • Item 1: CER Update Frequency
        • Item 2: Device Description & Claims
        • Item 3: Clinical Data
        • Item 4: Usability
        • Item 5: PMS Plan
        • Item 6: PMCF Plan
        • Item 7: Risk
        • completed-tasks
          • task-3b10-legacy-pms-document-hierarchy-refactor
          • task-3b11-sme-coverage-subspecialty-documentation
          • task-3b12-phase-1-exploratory-per-bucket-c-feature
          • task-3b13-man-2025-cep-cip-completeness
          • task-3b14-ifu-integration-requirements-verification
          • task-3b4-mrmc-dark-phototypes
          • task-3b6-surrogate-endpoint-literature-review
            • Appraisal log — CRIT1–7 rolling table
            • Do we need this task?
            • Integration map — propagation of the surrogate-endpoint validity review
            • references
            • Research prompts — external deep-research tools
            • Surrogate-Endpoint Validity in Dermatology AI — Structured Literature Review
          • task-3b7-icd-per-epidemiological-group-vv
          • task-3b8-safety-confirmation-column-definition
          • task-3b9-legacy-pms-conclusions-into-plus-pms-plan
        • Coverage matrix
        • resources
        • Task 3b-5: Autoimmune and Genodermatoses Triangulated-Evidence Package
      • Evidence rank & phases
      • Pre-submission review of R-TF-015-001 CEP and R-TF-015-003 CER
  • Pricing
  • Public tenders
  • Trainings
  • BSI Non-Conformities
  • Clinical Review
  • Round 1
  • completed-tasks
  • task-3b6-surrogate-endpoint-literature-review
  • Appraisal log — CRIT1–7 rolling table

Appraisal log — CRIT1–7 rolling table

Rolling scoring table across all included references. Per-publication appraisal files live in references/<domain>/<first-author-year-keyword>.md. CRIT scores per MEDDEV 2.7/1 Rev 4 / CEP literature-review methodology: 1 (weak) – 3 (strong) per criterion.

Legend​

  • CRIT1 Relevance to device / indication / surrogate domain
  • CRIT2 Quality of study methodology (design, sample size, controls)
  • CRIT3 Quality of reporting (endpoint definitions, statistical analysis, 95 % CIs)
  • CRIT4 Applicability to intended population (image-based dermatology, clinician-supervised use)
  • CRIT5 Evidence weight (1 retrospective / validation · 2 RCT / prospective cohort / consensus · 3 meta-analysis / systematic review / regulatory guideline)
  • CRIT6 Risk of bias
  • CRIT7 Contribution to specific surrogate-validity claim

Inclusion threshold: aggregate across CRIT1–CRIT7 such that the reference materially contributes to an anchor claim. Balancing references intentionally included for completeness.


Domain 1 — Diagnostic accuracy (benefit 7GH)​

ReferenceC1C2C3C4C5C6C7Strength
Esteva 2017 (Nature)3222123Strong (landmark)
Haenssle 2018 (Ann Oncol)3222223Strong
Haenssle 2020 (Ann Oncol, CE-marked)3233223Very strong
Tschandl 2020 (Nat Med, human-AI collab)3333223Very strong
Liu 2020 (Nat Med, DLS)3233123Strong
Dick 2019 (JAMA Derm, meta-analysis)3332323Very strong
Salinas 2024 (NPJ Digit Med, SR/MA)3333323Very strong
Winkler 2023 (JAMA Derm, prospective)3333223Very strong
Gershenwald 2017 (CA Cancer J Clin, AJCC 8)3333223Very strong (outcome anchor)
Conic 2018 (JAAD, NCDB surgical delay)3223123Strong (outcome bridge)
Daneshjou 2022 (Sci Adv) — BALANCING3333123Very strong (balancing)
Han 2018 (J Invest Dermatol) — BALANCING3223123Strong (balancing)
Freeman 2020 (BMJ, SR) — BALANCING3333323Very strong (balancing)

Domain 1 summary: 13 references (10 positive + 3 balancing). Minimum ≥ 8 cleared; target 10–12 achieved. Evidence base anchored by 3 systematic reviews / meta-analyses (Dick 2019, Salinas 2024, Freeman 2020), 1 prospective clinical study (Winkler 2023), 4 landmark reader studies (Esteva, Haenssle 2018/2020, Tschandl 2020), 1 large clinical-use validation (Liu 2020), 2 outcome-anchor references (Gershenwald 2017, Conic 2018) and 2 cross-ethnic/phototype balancing references (Daneshjou 2022, Han 2018).


Domain 2 — Severity scoring (benefit 5RB)​

ReferenceC1C2C3C4C5C6C7Strength
EMA 2004 (CHMP/EWP/2454/02, psoriasis guideline)3n/a33323Very strong (regulatory)
Schmitt 2014 (J Allergy Clin Immunol, HOME IV)3333323Very strong (consensus)
Simpson 2016 (NEJM, dupilumab SOLO 1/2)3333223Very strong (pivotal RCT)
King 2022 (NEJM, baricitinib BRAVE-AA)3333223Very strong (pivotal RCT)
Olsen 2004 (JAAD, SALT definition)3n/a23123Strong (foundational)
Mattei 2014 (JEADV, PASI-DLQI SR)3323323Very strong (quantitative anchor)
Mrowietz 2011 (Arch Dermatol Res, European treat-to-target)3233223Strong (operational anchor)
Fink 2018 (JEADV, PASI variability)3223123Strong (reliability anchor)
Schaap 2022 (JEADV, CNN PASI)3223123Strong (AI analytic validity)
Huang 2023 (JMIR Derm, AI PASI SkinTeller)3223223Strong (AI outperforms mean dermatologist)

Domain 2 summary: 10 references. Minimum ≥ 6 cleared; target 8–10 achieved. Evidence base anchored by 1 EU regulatory guideline (EMA 2004), 1 international consensus statement (HOME 2014), 2 pivotal phase-3 RCT papers (Simpson 2016 dupilumab, King 2022 baricitinib), 1 foundational instrument-definition paper (Olsen 2004), 2 quantitative-linkage references (Mattei 2014, Mrowietz 2011), 1 manual-reliability reference (Fink 2018), 2 AI-PASI analytic-validity references (Schaap 2022, Huang 2023).


Domain 3 — Referral optimisation / care-pathway (benefit 3KX)​

ReferenceC1C2C3C4C5C6C7Strength
Eminović 2009 (Arch Dermatol, cluster RCT)3333223Very strong (RCT, referral reduction)
Whited 2013 (J Telemed Telecare, RCT)3322223Strong (RCT, outcome equivalence)
Armstrong 2018 (JAMA Netw Open, equivalency RCT)3333223Very strong (RCT, chronic disease)
Finnane 2017 (JAMA Derm, SR)3323323Very strong (SR)
Chuchu 2018 (Cochrane, SR)3333323Very strong (Cochrane SR)
Giavina-Bianchi 2020 (eClinicalMedicine, 30K pts)3222123Strong (real-world wait-time anchor)
Moreno-Ramirez 2007 (Arch Dermatol, Seville)3233123Strong (EU wait-time anchor)
Snoswell 2016 (JAMA Derm, cost-effectiveness SR)3323323Very strong (health-economic SR)
Jain 2021 (JAMA Netw Open, AI triage PCP / NP)3233223Strong (AI-decision-support uplift)

Domain 3 summary: 9 references. Minimum ≥ 6 cleared; target 8–10 achieved. Evidence base anchored by 3 RCTs (Eminović 2009, Whited 2013, Armstrong 2018), 3 systematic reviews (Finnane 2017, Chuchu 2018 Cochrane, Snoswell 2016), 2 large real-world wait-time references (Giavina-Bianchi 2020, Moreno-Ramirez 2007) and 1 AI-triage uplift reference (Jain 2021).


Aggregate coverage​

DomainMinimumTargetAchievedPrimary source of evidence weight
Diagnostic accuracy (7GH)≥ 810–12133 SR/MA + 1 prospective + 4 reader studies + 2 outcome anchors + 3 balancing
Severity scoring (5RB)≥ 68–10101 EMA guideline + 1 international consensus + 2 phase-3 RCTs + 2 AI-PASI validations
Referral optimisation (3KX)≥ 68–1093 RCTs + 3 SRs + 2 real-world wait-time + 1 AI-triage
Total≥ 2026–3232

Mandatory balancing references present:

  • Daneshjou 2022 (phototype bias) — confirms generalisability limit
  • Han 2018 (cross-ethnicity generalisation) — complementary phototype evidence
  • Freeman 2020 (BMJ SR of smartphone apps) — confirms AI-dermatology heterogeneity across products; underpins device-specific clinical-data requirement

Declared evidence gaps (for CER and PMCF declaration)​

  1. No AI-dermatology RCT with mortality or stage-shift as primary endpoint. The surrogate-to-outcome chain rests on reader-study accuracy equivalence plus independently established AJCC stage-survival gradient.
  2. Phototype-stratified prospective evidence is sparse. Daneshjou 2022 quantifies the gap; motivates PMCF subgroup-stratified performance monitoring.
  3. Long-term automated-severity-scoring outcome data are thin. Analytic-validity evidence (Schaap 2022, Huang 2023) strong; durable DLQI / POEM outcome data following AI-PASI deployment unavailable.
  4. AI-triage RCT evidence lags teledermatology RCT evidence. Eminović 2009, Whited 2013, Armstrong 2018 test human-teledermatologist workflows; direct RCT of AI-triage vs. teledermatology not yet published.

These gaps are declared in surrogate-validity-review.md §8 and tracked as PMCF-plan commitments in R-TF-007-002.

Previous
MRMC cross-study comparison — BI_2024 · PH_2024 · SAN_2024 · MAN_2025
Next
Do we need this task?
  • Legend
  • Domain 1 — Diagnostic accuracy (benefit 7GH)
  • Domain 2 — Severity scoring (benefit 5RB)
  • Domain 3 — Referral optimisation / care-pathway (benefit 3KX)
  • Aggregate coverage
  • Declared evidence gaps (for CER and PMCF declaration)
All the information contained in this QMS is confidential. The recipient agrees not to transmit or reproduce the information, neither by himself nor by third parties, through whichever means, without obtaining the prior written permission of Legit.Health (AI Labs Group S.L.)