← All FAQs
analytics-and-reliability
How does Burna AI measure agreement between AI suggestions and clinicians?
Cohen's kappa is the primary metric, computed on the rater-versus-AI grade comparison for every reviewed case. Override rate (how often clinicians change the AI suggestion) and confidence calibration (how well the AI's stated confidence predicts override rate) are tracked alongside kappa. All three metrics feed the Quality tab in real time. External characterisation: strong agreement with expert clinicians in ongoing internal testing. Specific accuracy percentages are reserved for investor materials and peer-reviewed publications, not marketing copy.