Evaluating Large Language Models for Automated CPT Code Prediction in Endovascular Neurosurgery

Zhu C, Attaluri PK, Wirth PJ, et al. Current Applications of Artificial Intelligence in Billing Practices and Clinical Plastic Surgery. Plast Reconstr Surg Glob Open. 2024;12:e5939. doi: https://doi.org/10.1097/GOX.0000000000005939

Article  PubMed  PubMed Central  Google Scholar 

Burns ML, Mathis MR, Vandervest J, et al. Classification of Current Procedural Terminology Codes from Electronic Health Record Data Using Machine Learning. Anesthesiology. 2020;132:738–49. doi: https://doi.org/10.1097/ALN.0000000000003150

Article  PubMed  Google Scholar 

Tseng P, Kaplan RS, Richman BD, et al. Administrative Costs Associated With Physician Billing and Insurance-Related Activities at an Academic Health Care System. JAMA. 2018;319:691–7. doi: https://doi.org/10.1001/jama.2017.19148

Article  PubMed  PubMed Central  Google Scholar 

Isch EL, Sarikonda A, Sambangi A, et al. Evaluating the Efficacy of Large Language Models in CPT Coding for Craniofacial Surgery: A Comparative Analysis. J Craniofac Surg. Published Online First: 2 September 2024. doi: https://doi.org/10.1097/SCS.0000000000010575

Article  Google Scholar 

Levy J, Vattikonda N, Haudenschild C, et al. Comparison of Machine-Learning Algorithms for the Prediction of Current Procedural Terminology (CPT) Codes from Pathology Reports. J Pathol Inform. 2022;13:3. doi: https://doi.org/10.4103/jpi.jpi_52_21

Article  PubMed  Google Scholar 

O’Malley GR, Sarwar SA, Cassimatis ND, et al. Can Publicly Available Artificial Intelligence Successfully Identify Current Procedural Terminology Codes for Common Procedures in Neurosurgery? World Neurosurg. 2024;183:e860–70. doi: https://doi.org/10.1016/j.wneu.2024.01.043

Article  PubMed  Google Scholar 

Zaidat B, Tang J, Arvind V, et al. Can a Novel Natural Language Processing Model and Artificial Intelligence Automatically Generate Billing Codes From Spine Surgical Operative Notes? Global Spine J. 2024;14:2022–30. doi: https://doi.org/10.1177/21925682231164935

Article  PubMed  Google Scholar 

Matplotlib: A 2D Graphics Environment| IEEE Journals & Magazine| IEEE Xplore. https://ieeexplore.ieee.org/document/4160265 (accessed 9 October 2024)

Ali R, Tang OY, Connolly ID, et al. Performance of ChatGPT and GPT-4 on Neurosurgery Written Board Examinations. Neurosurgery. 2023;93:1353–65. doi: https://doi.org/10.1227/neu.0000000000002632

Article  PubMed  Google Scholar 

Ali R, Tang OY, Connolly ID, et al. Performance of ChatGPT, GPT-4, and Google Bard on a Neurosurgery Oral Boards Preparation Question Bank. Neurosurgery. 2023;93:1090–8. doi: https://doi.org/10.1227/neu.0000000000002551

Article  PubMed  Google Scholar 

Lang SP, Yoseph ET, Gonzalez-Suarez AD, et al. Analyzing Large Language Models’ Responses to Common Lumbar Spine Fusion Surgery Questions: A Comparison Between ChatGPT and Bard. Neurospine. 2024;21:633–41. doi: https://doi.org/10.14245/ns.2448098.049

Article  PubMed  PubMed Central  Google Scholar 

Goodman RS, Patrinely JR, Stone CA, et al. Accuracy and Reliability of Chatbot Responses to Physician Questions. JAMA Netw Open. 2023;6:e2336483. doi: https://doi.org/10.1001/jamanetworkopen.2023.36483

Article  PubMed  PubMed Central  Google Scholar 

Duszak R, Sacks D, Manowczak J. CPT coding by interventional radiologists: accuracy and implications. J Vasc Interv Radiol. 2001;12:447–54. doi: https://doi.org/10.1016/s1051-0443(07)61883-1

Article  PubMed  Google Scholar 

Duszak R, Blackham WC, Kusiak GM, et al. CPT coding by interventional radiologists: a multi-institutional evaluation of accuracy and its economic implications. J Am Coll Radiol. 2004;1:734–40. doi: https://doi.org/10.1016/j.jacr.2004.05.003

Article  PubMed  Google Scholar 

Himmelstein DU, Jun M, Busse R, et al. A comparison of hospital administrative costs in eight nations: US costs exceed all others by far. Health Aff (Millwood). 2014;33:1586–94. doi: https://doi.org/10.1377/hlthaff.2013.1327

Article  PubMed  Google Scholar 

Morra D, Nicholson S, Levinson W, et al. US physician practices versus Canadians: spending nearly four times as much money interacting with payers. Health Aff (Millwood). 2011;30:1443–50. doi: https://doi.org/10.1377/hlthaff.2010.0893

Article  PubMed  Google Scholar 

Friedman C, Shagina L, Lussier Y, et al. Automated encoding of clinical documents based on natural language processing. J Am Med Inform Assoc. 2004;11:392–402. doi: https://doi.org/10.1197/jamia.M1552

Article  PubMed  PubMed Central  Google Scholar 

Savova GK, Masanz JJ, Ogren PV, et al. Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications. J Am Med Inform Assoc. 2010;17:507–13. doi: https://doi.org/10.1136/jamia.2009.001560

Article  PubMed  PubMed Central  Google Scholar 

Zaidat B, Lahoti YS, Yu A, et al. Artificially Intelligent Billing in Spine Surgery: An Analysis of a Large Language Model. Global Spine J. 2023;21925682231224753. doi: https://doi.org/10.1177/21925682231224753

Hopkins BS, Carter B, Lord J, et al. Editorial. AtlasGPT: dawn of a new era in neurosurgery for intelligent care augmentation, operative planning, and performance. J Neurosurg. 2024;140:1211–4. doi: https://doi.org/10.3171/2024.2.JNS232997

Article  PubMed  Google Scholar 

Loftus TJ, Haider A, Upchurch GR Jr. Practical Guide to Artificial Intelligence, Chatbots, and Large Language Models in Conducting and Reporting Research. JAMA Surgery. Published Online First: 8 January 2025. doi: https://doi.org/10.1001/jamasurg.2024.6025

Patil A, Serrato P, Chisvo N, et al. Large language models in neurosurgery: a systematic review and meta-analysis. Acta Neurochir (Wien). 2024;166:475. doi: https://doi.org/10.1007/s00701-024-06372-9

Article  PubMed  Google Scholar 

Xu R, Hong Y, Zhang F, et al. Evaluation of the integration of retrieval-augmented generation in large language model for breast cancer nursing care responses. Sci Rep. 2024;14:30794. doi: https://doi.org/10.1038/s41598-024-81052-3

Article  CAS  PubMed  PubMed Central  Google Scholar 

Comments (0)

No login
gif