ChatGPT-4 performance in rhinology: A clinical case series.

ChatGPT‐4; artificial intelligence; head neck surgery; otolaryngology; performance; rhinology; Humans; Rhinitis/diagnosis; Rhinitis/therapy; Rhinitis/drug therapy; Male; Female; Otolaryngology; Rhinitis; Immunology and Allergy; Otorhinolaryngology

Abstract :

[en] Chatbot Generative Pre-trained Transformer (ChatGPT)-4 indicated more than twice additional examinations than practitioners in the management of clinical cases in rhinology. The consistency between ChatGPT-4 and practitioner in the indication of additional examinations may significantly vary from one examination to another. The ChatGPT-4 proposed a plausible and correct primary diagnosis in 62.5% cases, while pertinent and necessary additional examinations and therapeutic regimen were indicated in 7.5%-30.0% and 7.5%-32.5% of cases, respectively. The stability of ChatGPT-4 responses is moderate-to-high. The performance of ChatGPT-4 was not influenced by the human-reported level of difficulty of clinical cases.

Disciplines :

Otolaryngology

Author, co-author :

Radulesco, Thomas ; Research Committee of Young Otolaryngologists of the International Federation of Otorhinolaryngological Societies, Paris, France ; Aix Marseille University, APHM, CNRS, IUSTI, La Conception University Hospital, ENT-HNS Department, Marseille, France

Saibene, Alberto Maria ; Research Committee of Young Otolaryngologists of the International Federation of Otorhinolaryngological Societies, Paris, France ; Otolaryngology Unit, ASST Santi Paolo E Carlo, Department of Health Sciences, Università Degli Studi Di Milano, Milan, Italy

Michel, Justin; Aix Marseille University, APHM, CNRS, IUSTI, La Conception University Hospital, ENT-HNS Department, Marseille, France

Vaira, Luigi Angelo ; Research Committee of Young Otolaryngologists of the International Federation of Otorhinolaryngological Societies, Paris, France ; Maxillofacial Surgery Operative Unit, Department of Medicine, Surgery and Pharmacy, University of Sassari, Sassari, Italy ; PhD School of Biomedical Sciences, Department of Biomedical Sciences, University of Sassari, Sassari, Italy

Lechien, Jérome ; Université de Mons - UMONS > Faculté de Psychologie et des Sciences de l'Education > Service de Métrologie et Sciences du langage ; Université de Mons - UMONS > Faculté de Médecine et de Pharmacie > Service de Chirurgie

Language :

English

Title :

ChatGPT-4 performance in rhinology: A clinical case series.

Publication date :

June 2024

Journal title :

International Forum of Allergy and Rhinology

ISSN :

2042-6976

eISSN :

2042-6984

Publisher :

John Wiley and Sons Inc, United States

Volume :

Issue :

Pages :

1123 - 1130

Peer reviewed :

Peer Reviewed verified by ORBi

Additional URL :

https://onlinelibrary.wiley.com/doi/pdf/10.1002/alr.23323

Research unit :

M120 - Service de Chirurgie

Research institute :

Santé

Available on ORBi UMONS :

since 19 December 2024

Statistics

Number of views

44 (0 by UMONS)

Number of downloads

24 (0 by UMONS)

More statistics

Scopus citations^®

Scopus citations^®
without self-citations

OpenCitations

OpenAlex citations

Bibliography

Yoshiyasu Y, Wu F, Dhanda AK, Gorelik D, Takashima M, Ahmed OG. GPT-4 accuracy and completeness against International Consensus Statement on Allergy and Rhinology: rhinosinusitis. Int Forum Allergy Rhinol. 2023;13(12):2231-2234. doi:10.1002/alr.23201
Lechien JR, Maniaci A, Gengler I, Hans S, Chiesa-Estomba CM, Vaira LA. Validity and reliability of an instrument evaluating the performance of intelligent chatbot: the artificial intelligence performance instrument (AIPI). Eur Arch Otorhinolaryngol. 2023. doi:10.1007/s00405-023-08219-y
Patel ZM, Holbrook EH, Turner JH, et al. International consensus statement on allergy and rhinology: olfaction. Int Forum Allergy Rhinol. 2022;12(4):327-680. doi:10.1002/alr.22929
Wise SK, Damask C, Roland LT, et al. International consensus statement on allergy and rhinology: allergic rhinitis—2023. Int Forum Allergy Rhinol. 2023;13(4):293-859. doi:10.1002/alr.23090.5
Orlandi RR, Kingdom TT, Smith TL, et al. International consensus statement on allergy and rhinology: rhinosinusitis 2021. Int Forum Allergy Rhinol. 2021;11(3):213-739. doi:10.1002/alr.22741
Gercama AJ, de Haan M, van der Vleuten CPM. Reliability of the Amsterdam Clinical Challenge Scale (ACCS): a new instrument to assess the level of difficulty of patient cases in medical education. Med Educ. 2000;34(7):519-524.
Chiesa-Estomba CM, Lechien JR, Vaira LA, et al. Exploring the potential of Chat-GPT as a supportive tool for sialendoscopy clinical decision making and patient information support. Eur Arch Otorhinolaryngol. 2023. doi:10.1007/s00405-023-08104-8
Lechien JR, Georgescu BM, Hans S, Chiesa-Estomba CM. ChatGPT performance in laryngology and head and neck surgery: a clinical case-series. Eur Arch Otorhinolaryngol. 2024;281(1):319-333. doi:10.1007/s00405-023-08282-5
Ayoub NF, Lee YJ, Grimm D, Divi V. Head-to-head comparison of ChatGPT versus google search for medical knowledge acquisition. Otolaryngol Head Neck Surg. 2023. doi:10.1002/ohn.465
Perlis RH. Research Letter: application of GPT-4 to select next-step antidepressant treatment in major depression. medRxiv. 2023. doi:10.1101/2023.04.14.23288595