Science Cast

A UMLS-Augmented Framework for Improving Factuality in Large Language Models within Healthcare

Rui YangOctober 7, 2023 8:38pm

Views (47)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

A UMLS-Augmented Framework for Improving Factuality in Large Language Models within Healthcare

arXivPDFOctober 4, 2023 12:00am

Authors

Rui Yang, Edison Marrese-Taylor, Yuhe Ke, Lechao Cheng, Qingyu Chen, Irene Li

Abstract

Large language models (LLMs) have demonstrated powerful text generation capabilities, bringing unprecedented innovation to the healthcare field. While LLMs hold immense promise for applications in healthcare, applying them to real clinical scenarios presents significant challenges, as these models may generate content that deviates from established medical facts and even exhibit potential biases. In our research, we develop an augmented LLM framework based on the Unified Medical Language System (UMLS), aiming to better serve the healthcare community. We employ LLaMa2-13b-chat and ChatGPT-3.5 as our benchmark models, and conduct automatic evaluations using the ROUGE Score and BERTScore on 104 questions from the LiveQA test set. Additionally, we establish criteria for physician-evaluation based on four dimensions: Factuality, Completeness, Readability and Relevancy. ChatGPT-3.5 is used for physician evaluation with 20 questions on the LiveQA test set. Multiple resident physicians conducted blind reviews to evaluate the generated content, and the results indicate that this framework effectively enhances the factuality, completeness, and relevance of generated content. Our research demonstrates the effectiveness of using UMLS-augmented LLMs and highlights the potential application value of LLMs in in medical question-answering.

TwitterandLinkedIn

0 comments

Add comment

A UMLS-Augmented Framework for Improving Factuality in Large Language Models within Healthcare

A UMLS-Augmented Framework for Improving Factuality in Large Language Models within Healthcare

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments