Preliminary Statistical Analysis — Synthetic Data Validation
Dataset: synthetic-data-60.csv (60 synthetic respondents)
Date: 2026-04-09
Purpose: Validate that the physician questionnaire produces statistically meaningful data before deployment to real clients.
Important: This analysis uses synthetic data generated to validate the questionnaire design. The statistical conclusions below (significance levels, effect sizes, power) confirm that the questionnaire can produce meaningful results if real-world responses follow plausible distributions. These results must not be cited as evidence in the CER or BSI response. Only the real data study report (Phase 5) constitutes citable evidence. Known artifacts in the synthetic dataset are documented in synthetic-data-improvements.md; these artifacts affect data realism but do not invalidate the questionnaire design validation, which is the purpose of this analysis.
1. Likert summary statistics (per benefit)
All Likert questions use a 1–5 scale (1 = Strongly disagree, 5 = Strongly agree). Neutral = 3.0.
Benefit 7GH — Diagnostic accuracy
| Question | Description | n | Mean | Median | SD | 95% CI |
|---|---|---|---|---|---|---|
| B1 | General diagnostic accuracy | 60 | 3.77 | 4.0 | 1.18 | [3.46, 4.07] |
| B3 | Rare disease identification | 60 | 3.43 | 4.0 | 1.21 | [3.12, 3.75] |
| B5 | Malignancy detection/triage | 60 | 3.47 | 4.0 | 1.26 | [3.14, 3.79] |
Benefit 5RB — Objective severity assessment
| Question | Description | n | Mean | Median | SD | 95% CI |
|---|---|---|---|---|---|---|
| C1 | Reproducibility | 60 | 4.15 | 5.0 | 1.12 | [3.86, 4.44] |
| C2 | Treatment monitoring | 60 | 3.97 | 4.0 | 1.07 | [3.69, 4.24] |
| C3 | Inter-observer consistency | 60 | 2.92 | 3.0 | 1.18 | [2.61, 3.22] |
Benefit 3KX — Care pathway optimisation
| Question | Description | n | Mean | Median | SD | 95% CI |
|---|---|---|---|---|---|---|
| D1 | Waiting time reduction | 60 | 3.58 | 4.0 | 1.36 | [3.23, 3.93] |
| D3 | Referral adequacy | 60 | 4.12 | 4.0 | 1.04 | [3.85, 4.39] |
| D5 | Remote care enablement | 39 | 4.03 | 4.0 | 1.11 | [3.66, 4.39] |
Overall
| Question | Description | n | Mean | Median | SD | 95% CI |
|---|---|---|---|---|---|---|
| E1 | Overall benefit assessment | 60 | 3.77 | 4.0 | 1.33 | [3.42, 4.11] |
Safety
| Question | Description | n | Mean | Median | SD | 95% CI |
|---|---|---|---|---|---|---|
| F3 | Overall device safety | 60 | 4.15 | 4.0 | 0.90 | [3.92, 4.38] |
Interpretation: Benefit Likert means range from 2.92 (C3, inter-observer consistency) to 4.15 (C1, reproducibility). The wider spread compared to uniform clustering reflects realistic variation — some benefits receive stronger endorsement than others. The 95% CI for C3 and D1 includes or approaches neutral (3.0), indicating genuinely mixed opinions on these dimensions. The safety Likert (F3, mean 4.15) confirms that respondents broadly consider the device safe, even when they report lower benefit scores.
2. Quantitative summary statistics (stratified by data source)
Data source is determined by the evidence quality control question: (a) consulted records vs. (b) professional estimate. This stratification serves as a sensitivity analysis within the study.