R-TF-028-001 AI Description

Table of contents

Purpose
Scope
Non-Clinical Model Overview
Description and Specifications
Integration and Environment
- Integration
- Environment
References
Traceability to QMS Records

Purpose

This document defines the specifications, performance requirements, and data needs for the non-clinical Artificial Intelligence (AI) models used in the Legit.Health Plus device.

Scope

This document details the design and performance specifications for all non-clinical AI algorithms integrated into the Legit.Health Plus device. It establishes the foundation for the development, validation, and risk management of these models.

This description covers the following key areas for each algorithm:

Algorithm description, clinical objectives, and justification.
Performance endpoints and acceptance criteria.
Specifications for the data required for development and evaluation.
Requirements related to cybersecurity, transparency, and integration.
Links between the AI specifications and the overall risk management process.

Non-Clinical Model Overview

The Legit.Health Plus device integrates several non-clinical AI models that are essential for robust, equitable, and high-quality operation of the system. These models do not provide clinical diagnostic outputs, but instead perform technical, quality, and contextual functions that support the overall performance, safety, and fairness of the device. Non-clinical models include:

Image quality and preprocessing models (e.g., color correction)
Contextual attribute models (e.g., skin tone identification, body site identification)
Technical validation models (e.g., 3D reconstruction for area quantification)

These models:

Perform quality assurance, preprocessing, and technical validation
Enable downstream clinical models to operate within validated domains and with standardized inputs
Support equity, bias mitigation, and performance monitoring across diverse populations
Provide structured, non-clinical metadata (e.g., skin tone, body site, image quality) to enhance device reliability and fairness
Do not generate clinical diagnostic or severity outputs, nor do they provide interpretative distributions of ICD categories

Key Non-Clinical Models and Their Functions:

Acneiform Inflammatory Pattern Identification: Translates objective lesion counts and density into standardized IGA severity scores, supporting consistent acne severity assessment.
Skin Tone Identification: Automatically classifies images by Fitzpatrick and Monk skin tone scales to support bias mitigation, personalization, and regulatory compliance.
Body Site Identification: Detects anatomical regions present in images, enabling context-aware processing, BSA calculations, and site-specific workflow optimization.
3D Surface Area Quantification: Transforms 2D image segmentations into real-world 3D measurements, supporting accurate, reproducible area and volume calculations for research and quality assurance.
Color Correction: Standardizes color representation in images using reference markers, ensuring reliable color features for downstream models and human interpretation.

These non-clinical models are described in detail in the following section.

Description and Specifications

Acneiform Inflammatory Pattern Identification

Description

A mathematical equation ingests the tabular features derived from the Acneiform Inflammatory Lesion Quantification algorithm and outputs an score, on the scale [4], aligned with Investigator's Global Assessment (IGA).

The equation, $N^{a} \cdot (D + b)$ with parameters $a = 0.2053$ and $b = 1.0858$ , takes as input:

The number of acneiform inflammatory lesions $N$ .
The density of acneiform inflammatory lesions $D$ .

The final output is weighted x2.5 to align with an [10] scale, rather than [4], for a more granular output.

Sample images with acneiform inflammatory lesion detections and their confidence,the number of lesion (N), the density of the lesions (D), the calculated IGA scores, and the calculated ALADIN scores.

Objectives

Support healthcare professionals in providing standardized acne severity assessment using the validated Investigator's Global Assessment (IGA) scale.
Reduce inter-observer variability in IGA scoring, which shows moderate agreement (κ = 0.50-0.70) between raters in clinical practice [112].
Enable automated severity classification by translating objective lesion counts and density into clinically meaningful IGA categories.
Ensure reproducibility by basing severity assessment on quantitative features rather than subjective visual impression.
Facilitate treatment decision-making by providing standardized severity grades that align with evidence-based treatment guidelines (e.g., topical therapy for mild, systemic therapy for severe).
Support clinical trial endpoints by providing consistent, reproducible IGA assessments as required by regulatory agencies.

Justification (Clinical Evidence):

The IGA scale is a widely validated tool for acne severity assessment and is the most commonly used primary endpoint in acne clinical trials [113, 114].
Manual IGA assessment shows substantial inter-observer variability (κ = 0.50-0.70), with particular difficulty in distinguishing between adjacent grades [112].
Objective lesion counting combined with algorithmic severity classification has been shown to improve consistency (κ improvement to 0.75-0.85) compared to purely visual IGA assessment [115].
Treatment guidelines are explicitly linked to IGA grades, with clear recommendations for topical monotherapy (IGA 1-2), combination therapy (IGA 2-3), and systemic therapy consideration (IGA 3-4) [116].
Regulatory agencies require validated severity measures for acne trials, with IGA being the most accepted scale for primary efficacy endpoints [117].
Studies show that automated severity grading reduces assessment time by 40-60% while maintaining or improving accuracy compared to manual grading [118].

Endpoints and Requirements

Performance is evaluated using pearson correlation metric between the predicted and expert consensus, to ensure that the model aligns with the criteria from expert dermatologists.

Metric	Threshold	Interpretation
Pearson correlation	`≤ Expert Inter-observer Variability`	Model performance is non-inferior to expert inter-observer variability

Justification of the succeed criteria:

IGA is the scoring system recommended by the FDA for acne severity assessment in clinical trials. Therefore we seek for high correlation to this scale.
IGA is inherently subjective, with documented inter-observer variability among dermatologists.
The stablished succeed criteria ensures that the model's predictions are not less reliable than those made by expert dermatologists, making it suitable for clinical and research applications.

Requirements:

Implement a tabular model (e.g., gradient boosting, mathematical equation, random forest, neural network, or other ML model) that:
- Accepts numerical inputs dereived from Acneiform Inflammatory Lesion Quantification, such as the total inflammatory lesion count, lesion density, anatomical site identifiers, affected surface area, etc.
- Outputs a severity score highly correlated to the IGA scale.
Demonstrate a correlation with the ground-truth data non-inferior to the inter-observer variability among expert dermatologists.
Report all metrics with 95% confidence intervals.
Validate the model on an independent and diverse dataset including:
- Full range of IGA grades (0-4)
- Diverse patient populations (e.g., various Fitzpatrick skin types)
Ensure outputs are compatible with:
- FHIR-based structured reporting for interoperability
- Clinical decision support systems for acne treatment recommendations
- Treatment guidelines that specify interventions based on IGA grade
- Clinical trial data collection systems requiring standardized IGA assessments
Document the model optimization strategy including:
- Feature design
- Hyperparameter optimization methodology
- Rationale for model selection (if multiple architectures compared)
Provide evidence that:
- The model generalizes across different patient populations
- Predictions align with dermatologist consensus and clinical treatment guidelines