Article (Scientific journals)
Specialized Large Language Model Outperforms Neurologists at Complex Diagnosis in Blinded Case-Based Evaluation
Barrit, Sami; Torcida, Nathan; Mazeraud, Aurelien et al.
2025In Brain Sciences, 15 (4), p. 347
Peer Reviewed verified by ORBi
 

Files


Full Text
brainsci-15-00347-v3.pdf
Author postprint (2.66 MB)
Download

All documents in ORBi UMONS are protected by a user license.

Send to



Details



Abstract :
[en] Background/Objectives: Artificial intelligence (AI), particularly large language models (LLMs), has demonstrated versatility in various applications but faces challenges in specialized domains like neurology. This study evaluates a specialized LLM’s capability and trustworthiness in complex neurological diagnosis, comparing its performance to neurologists in simulated clinical settings. Methods: We deployed GPT-4 Turbo (OpenAI, San Francisco, CA, US) through Neura (Sciense, New York, NY, US), an AI infrastructure with a dual-database architecture integrating “long-term memory” and “short-term memory” components on a curated neurological corpus. Five representative clinical scenarios were presented to 13 neurologists and the AI system. Participants formulated differential diagnoses based on initial presentations, followed by definitive diagnoses after receiving conclusive clinical information. Two senior academic neurologists blindly evaluated all responses, while an independent investigator assessed the verifiability of AI-generated information. Results: AI achieved a significantly higher normalized score (86.17%) compared to neurologists (55.11%, p < 0.001). For differential diagnosis questions, AI scored 85% versus 46.15% for neurologists, and for final diagnosis, 88.24% versus 70.93%. AI obtained 15 maximum scores in its 20 evaluations and responded in under 30 s compared to neurologists’ average of 9 min. All AI-provided references were classified as relevant with no hallucinatory content detected. Conclusions: A specialized LLM demonstrated superior diagnostic performance compared to practicing neurologists across complex clinical challenges. This indicates that appropriately harnessed LLMs with curated knowledge bases can achieve domain-specific relevance in complex clinical disciplines, suggesting potential for AI as a time-efficient asset in clinical practice.
Disciplines :
Neurology
Author, co-author :
Barrit, Sami ;  Neurosurgery, Université Libre de Bruxelles, 1070 Brussels, Belgium ; Neurosurgery, CHU Tivoli, 7110 La Louvière, Belgium ; Neurodynamics Laboratory, Department of Neurosurgery, Boston Children’s Hospital, Harvard Medical School, Boston, MA 02115, USA ; Sciense, New York, NY 10013, USA
Torcida, Nathan ;  Sciense, New York, NY 10013, USA ; Neurology, Université Libre de Bruxelles, 1050 Brussels, Belgium
Mazeraud, Aurelien ;  Anesthésie-Réanimation, GHU Paris, Pôle Neuro, 75014 Paris, France ; Neurosciences, Université de Paris, 75006 Paris, France
Boulogne, Sebastien ;  Neurophysiology and Epileptology, Universite de Lyon, 69007 Lyon, France
Benoit, Jeanne ;  Neurology, CHU de Nice, Université Côte d’Azur, UMR2CA, 06000 Nice, France
Carette, Timothée;  Neurology, Université Catholique de Louvain, Clinique Saint-Pierre Ottignies, 1348 Louvain-la-Neuve, Belgium
Carron, Thibault;  LIP6, CNRS, Sorbonne Université, 75005 Paris, France
Delsaut, Bertil;  Neurology, Université Libre de Bruxelles, 1050 Brussels, Belgium ; Neurology, CHU Tivoli, 7110 La Louvière, Belgium
Diab, Eva ;  Clinical Neurophysiology, CHU Amiens Picardie, CHIMERE UR 7516 UPJV, 80054 Amiens, France
Kermorvant, Hugo;  Neurophy Lab, Université Libre de Bruxelles, 1050 Brussels, Belgium
Maarouf, Adil ;  Neurology, La Timone Hospital, AP-HM, 13385 Marseille, France ; Department of Neurology, Maladie Inflammatoire du Cerveau et de la Moelle Epinière (MICeME), Aix Marseille Université (AMU), CNRS, CRMBM, 13385 Marseille, France
Maldonado Slootjes, Sofia ;  Department of Neurology, Universitair Ziekenhuis Brussel (UZ Brussel), 1090 Brussels, Belgium ; NEUR Research Group, Vrije Universiteit Brussel (VUB), 1090 Brussels, Belgium
Redon, Sylvain ;  Evaluation and Treatment of Pain, FHU INOVPAIN, La Timone Hospital, AP-HM, 13385 Marseille, France
Robin, Alexis ;  Neurology, CHU Grenoble, 38700 Grenoble, France
Hadidane, Sofiene;  Cabinets de Neurologie d’Allauch et Plan de Cuques, 13190 Allauch, France
Harlay, Vincent ;  Neuro-Oncology, AMU, La Timone Hospital, AP-HM, 13005 Marseille, France
Tota, Vito  ;  Université de Mons - UMONS > Faculté de Médecine et de Pharmacie > Service de Neurosciences ; Neurology, CHU Helora, 7000 Mons, Belgium
Madec, Tanguy ;  Neurology, Hospital of Noumea, 98800 Nouméa, France
Niset, Alexandre;  Sciense, New York, NY 10013, USA ; Emergency Medicine, Université Catholique de Louvain, 1348 Louvain-la-Neuve, Belgium ; Pediatric Intensive Care Unit, Cliniques Universitaires Saint-Luc, 1200 Brussels, Belgium
Al Barajraji, Mejdeddine ;  Sciense, New York, NY 10013, USA ; Département des Neurosciences Cliniques, Centre Hospitalier Universitaire Vaudois (CHUV), 1005 Lausanne, Switzerland
Madsen, Joseph R.;  Neurodynamics Laboratory, Department of Neurosurgery, Boston Children’s Hospital, Harvard Medical School, Boston, MA 02115, USA
El Hadwe, Salim;  Neurosurgery, Université Libre de Bruxelles, 1070 Brussels, Belgium ; Sciense, New York, NY 10013, USA ; Clinical Neuroscience, University of Cambridge, Cambridge CB2 1TN, UK
Massager, Nicolas ;  Université de Mons - UMONS > Faculté de Médecine et de Pharmacie > Service du Doyen de la Faculté de Médecine et Pharmacie ; Neurosurgery, Université Libre de Bruxelles, 1070 Brussels, Belgium ; Neurosurgery, CHU Tivoli, 7110 La Louvière, Belgium
Lagarde, Stanislas ;  AMU, INSERM, Institut Neuroscience des Systèmes (INS), 13005 Marseille, France ; APHM, Timone Hospital, Epileptology and Cerebral Rhythmology, 13005 Marseille, France
Carron, Romain;  Sciense, New York, NY 10013, USA ; AMU, INSERM, Institut Neuroscience des Systèmes (INS), 13005 Marseille, France ; Stereotactic and Functional Neurosurgery, La Timone Hospital, AP-HM, 13385 Marseille, France
More authors (15 more) Less
Language :
English
Title :
Specialized Large Language Model Outperforms Neurologists at Complex Diagnosis in Blinded Case-Based Evaluation
Publication date :
27 March 2025
Journal title :
Brain Sciences
eISSN :
2076-3425
Publisher :
MDPI AG
Volume :
15
Issue :
4
Pages :
347
Peer reviewed :
Peer Reviewed verified by ORBi
Available on ORBi UMONS :
since 03 April 2025

Statistics


Number of views
16 (2 by UMONS)
Number of downloads
4 (1 by UMONS)

Scopus citations®
 
1
Scopus citations®
without self-citations
0
OpenCitations
 
0
OpenAlex citations
 
1

Bibliography


Similar publications



Contact ORBi UMONS