Instruction Tuning with GPT-4

Peng, Baolin; Li, Chunyuan; He, Pengcheng; Galley, Michel; Gao, Jianfeng

doi:10.48550/arxiv.2304.03277

preprintarXiv (Cornell University)Apr 6, 2023GREEN OA

Instruction Tuning with GPT-4

BPBaolin Peng CLChunyuan Li PHPengcheng He MGMichel Galley JGJianfeng Gao

Indexed inarxivdatacite

Abstract

Prior work has shown that finetuning large language models (LLMs) using machine-generated instruction-following data enables such models to achieve remarkable zero-shot capabilities on new tasks, and no human-written instructions are needed. In this paper, we present the first attempt to use GPT-4 to generate instruction-following data for LLM finetuning. Our early experiments on instruction-tuned LLaMA models show that the 52K English and Chinese instruction-following data generated by GPT-4 leads to superior zero-shot performance on new tasks to the instruction-following data generated by previous state-of-the-art models. We also collect feedback and comparison data from GPT-4 to enable a comprehensive…

Citation impact

188

total citations

FWCI: —
Percentile: —
References: 0

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Computer science
Codebase
One shot
Language model
Artificial intelligence
Programming language

UN Sustainable Development Goals

Quality Education

No related works found for this paper.