PMC-LLaMA: toward building open-source language models for medicine
Shanghai Jiao Tong University · Shandong Jiaotong University · +1 more institution
Abstract
Recently, large language models (LLMs) have showcased remarkable capabilities in natural language understanding. While demonstrating proficiency in everyday conversations and question-answering (QA) situations, these models frequently struggle in domains that require precision, such as medical applications, due to their lack of domain-specific knowledge. In this article, we describe the procedure for building a powerful, open-source language model specifically designed for medicine applications, termed as PMC-LLaMA.
We adapt a general-purpose LLM toward the medical domain, involving data-centric knowledge injection through the integration of 4.8M biomedical academic papers and 30K medical textbooks, as well as comprehensive domain-specific instruction fine-tuning, encompassing medical QA, rationale for reasoning, and conversational dialogues with 202M tokens.
Citation impact
- FWCI
- 27.15
- Percentile
- 100%
- References
- 12
Authors
6- CWChaoyi Wu
Shanghai Jiao Tong University, Shandong Jiaotong University, Shanghai Artificial Intelligence Laboratory
- WLWeixiong Lin
Shanghai Jiao Tong University, Shandong Jiaotong University, Shanghai Artificial Intelligence Laboratory
- XZXiaoman Zhang
Shanghai Jiao Tong University, Shandong Jiaotong University, Shanghai Artificial Intelligence Laboratory
- YZYa Zhang
Shanghai Jiao Tong University, Shandong Jiaotong University, Shanghai Artificial Intelligence Laboratory
- WXWeidi Xie
Shanghai Jiao Tong University, Shandong Jiaotong University, Shanghai Artificial Intelligence Laboratory
Topics & keywords
- Computer science
- Open source
- Natural language processing
- Programming language
- Software
- Quality Education