[en] Chatbot Generative Pre-trained Transformer (ChatGPT)-4 indicated more than twice additional examinations than practitioners in the management of clinical cases in rhinology. The consistency between ChatGPT-4 and practitioner in the indication of additional examinations may significantly vary from one examination to another. The ChatGPT-4 proposed a plausible and correct primary diagnosis in 62.5% cases, while pertinent and necessary additional examinations and therapeutic regimen were indicated in 7.5%-30.0% and 7.5%-32.5% of cases, respectively. The stability of ChatGPT-4 responses is moderate-to-high. The performance of ChatGPT-4 was not influenced by the human-reported level of difficulty of clinical cases.
Disciplines :
Otolaryngology
Author, co-author :
Radulesco, Thomas ; Research Committee of Young Otolaryngologists of the International Federation of Otorhinolaryngological Societies, Paris, France ; Aix Marseille University, APHM, CNRS, IUSTI, La Conception University Hospital, ENT-HNS Department, Marseille, France
Saibene, Alberto Maria ; Research Committee of Young Otolaryngologists of the International Federation of Otorhinolaryngological Societies, Paris, France ; Otolaryngology Unit, ASST Santi Paolo E Carlo, Department of Health Sciences, Università Degli Studi Di Milano, Milan, Italy
Michel, Justin; Aix Marseille University, APHM, CNRS, IUSTI, La Conception University Hospital, ENT-HNS Department, Marseille, France
Vaira, Luigi Angelo ; Research Committee of Young Otolaryngologists of the International Federation of Otorhinolaryngological Societies, Paris, France ; Maxillofacial Surgery Operative Unit, Department of Medicine, Surgery and Pharmacy, University of Sassari, Sassari, Italy ; PhD School of Biomedical Sciences, Department of Biomedical Sciences, University of Sassari, Sassari, Italy
Lechien, Jérome ; Université de Mons - UMONS > Faculté de Psychologie et des Sciences de l'Education > Service de Métrologie et Sciences du langage ; Université de Mons - UMONS > Faculté de Médecine et de Pharmacie > Service de Chirurgie
Language :
English
Title :
ChatGPT-4 performance in rhinology: A clinical case series.
Yoshiyasu Y, Wu F, Dhanda AK, Gorelik D, Takashima M, Ahmed OG. GPT-4 accuracy and completeness against International Consensus Statement on Allergy and Rhinology: rhinosinusitis. Int Forum Allergy Rhinol. 2023;13(12):2231-2234. doi:10.1002/alr.23201
Lechien JR, Maniaci A, Gengler I, Hans S, Chiesa-Estomba CM, Vaira LA. Validity and reliability of an instrument evaluating the performance of intelligent chatbot: the artificial intelligence performance instrument (AIPI). Eur Arch Otorhinolaryngol. 2023. doi:10.1007/s00405-023-08219-y
Patel ZM, Holbrook EH, Turner JH, et al. International consensus statement on allergy and rhinology: olfaction. Int Forum Allergy Rhinol. 2022;12(4):327-680. doi:10.1002/alr.22929
Wise SK, Damask C, Roland LT, et al. International consensus statement on allergy and rhinology: allergic rhinitis—2023. Int Forum Allergy Rhinol. 2023;13(4):293-859. doi:10.1002/alr.23090.5
Orlandi RR, Kingdom TT, Smith TL, et al. International consensus statement on allergy and rhinology: rhinosinusitis 2021. Int Forum Allergy Rhinol. 2021;11(3):213-739. doi:10.1002/alr.22741
Gercama AJ, de Haan M, van der Vleuten CPM. Reliability of the Amsterdam Clinical Challenge Scale (ACCS): a new instrument to assess the level of difficulty of patient cases in medical education. Med Educ. 2000;34(7):519-524.
Chiesa-Estomba CM, Lechien JR, Vaira LA, et al. Exploring the potential of Chat-GPT as a supportive tool for sialendoscopy clinical decision making and patient information support. Eur Arch Otorhinolaryngol. 2023. doi:10.1007/s00405-023-08104-8
Lechien JR, Georgescu BM, Hans S, Chiesa-Estomba CM. ChatGPT performance in laryngology and head and neck surgery: a clinical case-series. Eur Arch Otorhinolaryngol. 2024;281(1):319-333. doi:10.1007/s00405-023-08282-5
Ayoub NF, Lee YJ, Grimm D, Divi V. Head-to-head comparison of ChatGPT versus google search for medical knowledge acquisition. Otolaryngol Head Neck Surg. 2023. doi:10.1002/ohn.465
Perlis RH. Research Letter: application of GPT-4 to select next-step antidepressant treatment in major depression. medRxiv. 2023. doi:10.1101/2023.04.14.23288595