GeneGPT: augmenting large language models with domain tools for improved access to biomedical information
National Institutes of Health · National Center for Biotechnology Information
Abstract
MOTIVATION: While large language models (LLMs) have been successfully applied to various tasks, they still face challenges with hallucinations. Augmenting LLMs with domain-specific tools such as database utilities can facilitate easier and more precise access to specialized knowledge. In this article, we present GeneGPT, a novel method for teaching LLMs to use the Web APIs of the National Center for Biotechnology Information (NCBI) for answering genomics questions. Specifically, we prompt Codex to solve the GeneTuring tests with NCBI Web APIs by in-context learning and an augmented decoding algorithm that can detect and execute API calls. RESULTS: Experimental results show that GeneGPT achieves…
Citation impact
- FWCI
- 14.54
- Percentile
- 100%
- References
- 19
Authors
4- QJQiao JinCorresponding
National Institutes of Health, National Center for Biotechnology Information
- YYYifan Yang
National Institutes of Health, National Center for Biotechnology Information
- QCQingyu Chen
National Institutes of Health, National Center for Biotechnology Information
- ZLZhiyong Lu
National Institutes of Health, National Center for Biotechnology Information
Topics & keywords
- Computer science
- Domain (mathematical analysis)
- Domain-specific language
- Software
- Software engineering
- Programming language
- Mathematics