Comparative performance analysis of global and chinese-domain large language models for myopia

Betzler BK, Chen H, Cheng CY, Lee CS, Ning G, Song SJ, et al. Large language models and their impact in ophthalmology. Lancet Digit Health. 2023;5:e917–e924.

Article  CAS  PubMed  PubMed Central  Google Scholar 

Antaki F, Touma S, Milad D, El-Khoury J, Duval R. Evaluating the performance of ChatGPT in ophthalmology: an analysis of its successes and shortcomings. Ophthalmol Sci. 2023;3:100324.

Mihalache A, Popovic MM, Muni RH. Performance of an artificial intelligence chatbot in ophthalmic knowledge assessment. JAMA Ophthalmol. 2023;141:589–97.

Article  PubMed  PubMed Central  Google Scholar 

Momenaei B, Wakabayashi T, Shahlaee A, Durrani AF, Pandit SA, Wang K, et al. Appropriateness and readability of ChatGPT-4-generated responses for surgical treatment of retinal diseases. Ophthalmol Retina. 2023;7:862–8.

Xu L, Sanders L, Li K, Chow JCL. Chatbot for health care and oncology applications using artificial intelligence and machine learning: systematic review. JMIR Cancer. 2021;7:e27850.

Article  PubMed  PubMed Central  Google Scholar 

Moor M, Banerjee O, Abad ZSH, Krumholz HM, Leskovec J, Topol EJ, et al. Foundation models for generalist medical artificial intelligence. Nature. 2023;616:259–65.

Article  CAS  PubMed  Google Scholar 

Haupt CE, Marks M. AI-generated medical advice—GPT and beyond. JAMA. 2023;329:1349–50.

Article  PubMed  Google Scholar 

OpenAI. Is ChatGPT biased? 2023. https://help.openai.com/en/articles/8313359-is-chatgpt-biased.

Anil R, Dai AM, Firat O, Johnson M, Lepikhin D, Passos A, et al. PaLM 2 technical report. 2023. Preprint at https://arxiv.org/abs/2305.10403.

Wang X, Gong Z, Wang G, Jia J, Xu Y, Zhao J, et al. ChatGPT performs on the chinese national medical licensing examination. J Med Syst. 2023;47. 86.

Article  PubMed  Google Scholar 

Liu X, Wu J, Shao A, Shen W, Ye P, Wang Y, et al. Uncovering language disparity of chatgpt on retinal vascular disease classification: cross-sectional study. J Med Internet Res. 2024;26:e51926.

Article  PubMed  PubMed Central  Google Scholar 

Wang H, Wu W, Dou Z, He L, Yang L. Performance and exploration of ChatGPT in medical examination, records and education in Chinese: pave the way for medical AI. Int J Med Inf. 2023;177:105173.

Article  Google Scholar 

Li R, Zhang K, Li SM, Zhang Y, Tian J, Lu Z, et al. Implementing a digital comprehensive myopia prevention and control strategy for children and adolescents in China: a cost-effectiveness analysis. Lancet Reg Health West Pac. 2023;38:100837.

PubMed  PubMed Central  Google Scholar 

Chen M, Wu A, Zhang L, Wang W, Chen X, Yu X, et al. The increasing prevalence of myopia and high myopia among high school students in Fenghua city, eastern China: a 15-year population-based survey. BMC Ophthalmol. 2018;18:159.

Article  PubMed  PubMed Central  Google Scholar 

Baird PN, Saw SM, Lanca C, Guggenheim JA, Smith Iii EL, Zhou X, et al. Myopia. Nat Rev Dis Prim. 2020;6:99.

Article  PubMed  Google Scholar 

Morgan IG, French AN, Ashby RS, Guo X, Ding X, He M, et al. The epidemics of myopia: aetiology and prevention. Prog Retin Eye Res. 2018;62:134–49.

Article  PubMed  Google Scholar 

Baidu index searching. 2023. https://index.baidu.com/v2/index.html#/.

Lim ZW, Pushpanathan K, Yew SME, Lai Y, Sun CH, Lam JSH, et al. Benchmarking large language models’ performances for myopia care: a comparative analysis of ChatGPT-3.5, ChatGPT-4.0, and Google Bard. EBioMedicine. 2023;95. 104770.

Article  PubMed  PubMed Central  Google Scholar 

Chinese J Sci. https://finance.sina.com.cn/wm/2023-06-20/doc-imyxyaxf8235213.shtml 2023.

Wall Street News. Introducing qwen-7B: open foundation and human-aligned models (of the state-of-the-art). 2023. https://zhuanlan.zhihu.com/p/648007297?utm_id=0.

Hongbo Z, Junying C, Feng J, Fei Yu, Zhihong C, Jianquan L, et al. HuatuoGPT, towards taming language models to be a doctor. Preprint at arXiv:2305.15075 [csCL] 2023.

Kraljevic Z, Shek A, Bean D, Bendayan R, Teo J, Dobson R. MedGPT: medical concept prediction from clinical narratives. 2021.

Baidu ERNIE. https://baijiahao.baidu.com/s?id=1775669682987813307&wfr=spider&for=pc.202.

Llama2-Chinese. https://ollama.com/library/llama2-chinese:7b-chat.S 2023.

Li Y, Li Z, Zhang K, Dan R, Jiang S, Zhang Y. ChatDoctor: a medical chat model fine-tuned on a large language model meta-AI (LLaMA) using medical domain knowledge. Cureus. 2023;15:e40895.

PubMed  PubMed Central  Google Scholar 

Rasmussen MLR, Larsen AC, Subhi Y, Potapenko I. Artificial intelligence-based ChatGPT chatbot responses for patient and parent questions on vernal keratoconjunctivitis. Graefes Arch Clin Exp Ophthalmol. 2023;261:3041–43.

Lahat A, Shachar E, Avidan B, Glicksberg B, Klang E. Evaluating the utility of a large language model in answering common patients’ gastrointestinal health-related questions: are we there yet?. Diagnostics. 2023;13:1950.

Article  PubMed  PubMed Central  Google Scholar 

Johnson D, Goodman R, Patrinely J, Stone C, Zimmerman E, Donald R, et al. Assessing the accuracy and reliability of AI-generated medical responses: an evaluation of the Chat-GPT model. Res Square. 2023:rs.3.rs-2566942.

Li H, Moon JT, Purkayastha S, Celi LA, Trivedi H, Gichoya JW. Ethics of large language models in medicine and medical research. Lancet Digital Health. 2023;5:e333–e335.

Article  CAS  PubMed  Google Scholar 

Luo MJ, Pang J, Bi S, Lai Y, Zhao J, Shang Y, et al. Development and evaluation of a retrieval-augmented large language model framework for ophthalmology. JAMA Ophthalmol. 2024;142:798–805.

Article  PubMed  PubMed Central  Google Scholar 

Comments (0)

No login
gif