Role of ChatGPT and Gemini in the Urology Field: A Case-Based Study
Abstract
Introduction
The healthcare sector is witnessing a transformation with the advent of artificial intelligence (AI), exemplified by ChatGPT and Gemini AI. These AI systems emulate human conversation and provide accurate medical responses. This study explores their integration into medical decision-making in the urology field.
Methods
The study presented a collection of 20 medical case scenarios, carefully crafted and revised by a team of authors in the field of urology. Each case was presented to ChatGPT and Gemini in September of 2023, and their responses were recorded and analyzed.
Results
Both AI tools displayed varying accuracy in diagnoses and management recommendations. ChatGPT failed in identifying congenital penile curvature, while Gemini succeeded. Conversely, ChatGPT excelled in recommending a management plan for renal artery aneurysms. Gemini outperformed in explaining iodinated contrast material toxicity. Both struggled with a bladder prolapse prevention question.
Conclusion
AI integration in urology is promising but has limitations. AI provides valuable insights but cannot replace human expertise. Research is vital to improve AI's role in urology. Clinicians should view AI suggestions as supplements to their judgment, fostering collaborative healthcare decisions.
References
- Malik P, Pathania M, Rathaur VK. Overview of artificial intelligence in medicine. Journal of family medicine and primary care. 2019;8(7):2328-31. doi:10.4103/jfmpc.jfmpc_440_19
- Aydın Ö. Google Bard Generated Literature Review: Metaverse. Journal of AI. 2023;7(1): 1-14. doi:10.61969/jai.1311271
- Ray PP. ChatGPT: A comprehensive review on background, applications, key challenges, bias, ethics, limitations and future scope. Internet of Things and Cyber-Physical Systems. 2023 Apr 14. doi:10.1016/j.iotcps.2023.04.003
- Gilson A, Safranek CW, Huang T, Socrates V, Chi L, Taylor RA, et al. How Does ChatGPT Perform on the United States Medical Licensing Examination? The Implications of Large Language Models for Medical Education and Knowledge Assessment. JMIR Medical Education. 2023;9:e45312 doi:10.2196/45312
- Ayers JW, Zhu Z, Poliak A, Leas EC, Dredze M, Hogarth M, et al. Evaluating Artificial Intelligence Responses to Public Health Questions. JAMA Network Open. 2023;6(6):e2317517. doi:10.1001/jamanetworkopen.2023.17517
- Doddi S, Hibshman T, Salichs O, Bera K, Tippareddy C, Ramaiya N, Tirumani SH. Assessing Appropriate Responses to ACR Urologic Imaging Scenarios using ChatGPT and Bard. Current Problems in Diagnostic Radiology. 2023 Oct 20. doi:10.1067/j.cpradiol.2023.10.022
- Caruccio L, Cirillo S, Polese G, Solimando G, Sundaramurthy S, Tortora G. Can ChatGPT provide intelligent diagnoses? A comparative study between predictive models and ChatGPT to define a new medical diagnostic bot. Expert Systems with Applications. 2023:121186. doi:10.1016/j.eswa.2023.121186
- Cakir H, Caglar U, Yildiz O, Meric A, Ayranci A, Ozgor F. Evaluating the performance of ChatGPT in answering questions related to urolithiasis. International Urology and Nephrology. 2023:1-5. doi:10.5152/tud.2023.23171
- Harika J, Baleeshwar P, Navya K, Shanmugasundaram H. A review on artificial intelligence with deep human reasoning. In2022 International Conference on Applied Artificial Intelligence and Computing (ICAAIC) 2022 9 (pp. 81-84). IEEE. doi:10.1109/ICAAIC53929.2022.9793310
- Wang F, Casalino LP, Khullar D. Deep learning in medicine—promise, progress, and challenges. JAMA Internal Medicine. 2019;179(3):293-4. doi:10.1001/jamainternmed.2018.7117
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.