articleBioinformaticsFeb 1, 2024GOLD OA

GeneGPT: augmenting large language models with domain tools for improved access to biomedical information

National Institutes of Health · National Center for Biotechnology Information

PubMed
Indexed inarxivcrossrefdoajpubmed

Abstract

MOTIVATION: While large language models (LLMs) have been successfully applied to various tasks, they still face challenges with hallucinations. Augmenting LLMs with domain-specific tools such as database utilities can facilitate easier and more precise access to specialized knowledge. In this article, we present GeneGPT, a novel method for teaching LLMs to use the Web APIs of the National Center for Biotechnology Information (NCBI) for answering genomics questions. Specifically, we prompt Codex to solve the GeneTuring tests with NCBI Web APIs by in-context learning and an augmented decoding algorithm that can detect and execute API calls. RESULTS: Experimental results show that GeneGPT achieves…

No related works found for this paper.

Funding