Science Cast

Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model

Qichen YeOctober 16, 2023 8:12am

Views (48)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model

arXivPDFOctober 13, 2023 12:00am

Authors

Qichen Ye, Junling Liu, Dading Chong, Peilin Zhou, Yining Hua, Andrew Liu

Abstract

Integrating large language models (LLMs) into healthcare presents potential but faces challenges. Directly pre-training LLMs for domains like medicine is resource-heavy and sometimes unfeasible. Sole reliance on Supervised Fine-tuning (SFT) can result in overconfident predictions and may not tap into domain specific insights. Addressing these challenges, we present a multi-stage training method combining Domain-specific Continued Pre-training (DCPT), SFT, and Direct Preference Optimization (DPO). A notable contribution of our study is the introduction of a 3Gb Chinese Medicine (ChiMed) dataset, encompassing medical question answering, plain texts, knowledge graphs, and dialogues, segmented into three training stages. The medical LLM trained with our pipeline, Qilin-Med, exhibits significant performance boosts. In the CPT and SFT phases, it achieves 38.4% and 40.0% accuracy on the CMExam, surpassing Baichuan-7B's 33.5%. In the DPO phase, on the Huatuo-26M test set, it scores 16.66 in BLEU-1 and 27.44 in ROUGE1, outperforming the SFT's 12.69 and 24.21. This highlights the strength of our training approach in refining LLMs for medical applications.

TwitterandLinkedIn

0 comments

Add comment

Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model

Qilin-Med: Multi-stage Knowledge Injection Advanced Medical Large Language Model

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments