Development of a liver disease–specific large language model chat interface using retrieval-augmented generation

Ge, Jin; Sun, Steve; Owens, Joseph F.; Galvez, Victor; Gologorskaya, Oksana; Lai, Jennifer C.; Pletcher, Mark J.; Lai, Ki

doi:10.1097/hep.0000000000000834

articleHepatologyMar 7, 2024GREEN OA

Development of a liver disease–specific large language model chat interface using retrieval-augmented generation

JGJin Ge SSSteve Sun JFJoseph F. Owens VGVictor Galvez OGOksana Gologorskaya

University of California, San Francisco

PubMed

Indexed incrossrefpubmed

Abstract

Results

We evaluated LiVersa's performance by conducting 2 rounds of testing. First, we compared LiVersa's outputs versus those of trainees from a previously published knowledge assessment. LiVersa answered all 10 questions correctly. Second, we asked 15 hepatologists to evaluate the outputs of 10 hepatology topic questions generated by LiVersa, OpenAI's ChatGPT 4, and Meta's Large Language Model Meta AI 2. LiVersa's outputs were more accurate but were rated less comprehensive and safe compared to those of ChatGPT 4.

Conclusions

In this demonstration, we built disease-specific and protected health information-compliant LLMs using RAG. While LiVersa demonstrated higher accuracy in answering questions related to hepatology, there were some deficiencies due to limitations set by the number of documents used for RAG. LiVersa will likely require further refinement before potential live deployment. The LiVersa prototype, however, is a proof of concept for utilizing RAG to customize LLMs for clinical use cases.

Citation impact

126

total citations

FWCI: 13.41
Percentile: 100%
References: 31

Citations per year

Authors

8

Topics & keywords

Topics

Keywords

Hepatology
Computer science
Set (abstract data type)
Medicine
Information retrieval
Internal medicine
Programming language

No related works found for this paper.

Funding

UO
University of California, San Francisco