Article (Scientific journals)
Validity and reliability of an instrument evaluating the performance of intelligent chatbot: the Artificial Intelligence Performance Instrument (AIPI).
Lechien, Jérome; Maniaci, Antonino; Gengler, Isabelle et al.
2023In European Archives of Oto-Rhino-Laryngology
Peer Reviewed verified by ORBi
 

Files


Full Text
39.pdf
Author postprint (1.21 MB)
Download

All documents in ORBi UMONS are protected by a user license.

Send to



Details



Keywords :
Artificial; ChatGPT; Chatbot; Comparison; Diagnosis; GPT; Head neck; Instrument; Intelligence; Medicine; Otolaryngology; Performance; Surgery; Tool; Treatment; Otorhinolaryngology; General Medicine
Abstract :
[en] [en] OBJECTIVES: To evaluate the reliability and validity of the Artificial Intelligence Performance Instrument (AIPI). METHODS: Medical records of patients consulting in otolaryngology were evaluated by physicians and ChatGPT for differential diagnosis, management, and treatment. The ChatGPT performance was rated twice using AIPI within a 7-day period to assess test-retest reliability. Internal consistency was evaluated using Cronbach's α. Internal validity was evaluated by comparing the AIPI scores of the clinical cases rated by ChatGPT and 2 blinded practitioners. Convergent validity was measured by comparing the AIPI score with a modified version of the Ottawa Clinical Assessment Tool (OCAT). Interrater reliability was assessed using Kendall's tau. RESULTS: Forty-five patients completed the evaluations (28 females). The AIPI Cronbach's alpha analysis suggested an adequate internal consistency (α = 0.754). The test-retest reliability was moderate-to-strong for items and the total score of AIPI (rs = 0.486, p = 0.001). The mean AIPI score of the senior otolaryngologist was significantly higher compared to the score of ChatGPT, supporting adequate internal validity (p = 0.001). Convergent validity reported a moderate and significant correlation between AIPI and modified OCAT (rs = 0.319; p = 0.044). The interrater reliability reported significant positive concordance between both otolaryngologists for the patient feature, diagnostic, additional examination, and treatment subscores as well as for the AIPI total score. CONCLUSIONS: AIPI is a valid and reliable instrument in assessing the performance of ChatGPT in ear, nose and throat conditions. Future studies are needed to investigate the usefulness of AIPI in medicine and surgery, and to evaluate the psychometric properties in these fields.
Disciplines :
Otolaryngology
Author, co-author :
Lechien, Jérome  ;  Université de Mons - UMONS > Faculté de Psychologie et des Sciences de l'Educatio > Service de Métrologie et Sciences du langage
Maniaci, Antonino;  Research Committee of Young Otolaryngologists of the International Federation of Otorhinolaryngological Societies (IFOS), Paris, France ; Department of Medical, Surgical Sciences and Advanced Technologies G.F. Ingrassia, ENT Section, University of Catania, 95123, Catania, Italy
Gengler, Isabelle;  Research Committee of Young Otolaryngologists of the International Federation of Otorhinolaryngological Societies (IFOS), Paris, France ; Department of Otolaryngology-Head and Neck Surgery, University of Cincinnati Medical Center, Cincinnati, OH, USA
Hans, Stephane;  Research Committee of Young Otolaryngologists of the International Federation of Otorhinolaryngological Societies (IFOS), Paris, France ; Phonetics and Phonology Laboratory (UMR 7018 CNRS, Université Sorbonne Nouvelle/Paris 3), Department of Otorhinolaryngology and Head and Neck Surgery, Foch Hospital, School of Medicine, UFR Simone Veil, Université Versailles Saint-Quentin-en-Yvelines (Paris Saclay University), Paris, France
Chiesa-Estomba, Carlos M;  Research Committee of Young Otolaryngologists of the International Federation of Otorhinolaryngological Societies (IFOS), Paris, France ; Young Confederation of the European Oto-Rhino-Laryngological Head and Neck Surgery Societies (Y-CEORLHNS), Dublin, Ireland ; Department of Otorhinolaryngology - Head and Neck Surgery, Donostia University Hospital - Biodonostia Research Institute, St. Sebastian, Spain
Vaira, Luigi A;  Research Committee of Young Otolaryngologists of the International Federation of Otorhinolaryngological Societies (IFOS), Paris, France ; Maxillofacial Surgery Operative Unit, Department of Medicine, Surgery and Pharmacy, University of Sassari, Sassari, Italy ; Biomedical Science Department, Biomedical Science PhD School, University of Sassari, Sassari, Italy
Language :
English
Title :
Validity and reliability of an instrument evaluating the performance of intelligent chatbot: the Artificial Intelligence Performance Instrument (AIPI).
Publication date :
12 September 2023
Journal title :
European Archives of Oto-Rhino-Laryngology
ISSN :
0937-4477
eISSN :
1434-4726
Publisher :
Springer Science and Business Media Deutschland GmbH, Germany
Peer reviewed :
Peer Reviewed verified by ORBi
Research unit :
M112 - Anatomie humaine et Oncologie expérimentale
Research institute :
R550 - Institut des Sciences et Technologies de la Santé
R350 - Institut de recherche en sciences et technologies du langage
Available on ORBi UMONS :
since 25 December 2023

Statistics


Number of views
14 (1 by UMONS)
Number of downloads
487 (1 by UMONS)

Scopus citations®
 
48
Scopus citations®
without self-citations
28
OpenAlex citations
 
52

Bibliography


Similar publications



Contact ORBi UMONS