A systematic evaluation of large language models of code

Xu, Frank F.; Alon, Uri; Neubig, Graham; Hellendoorn, Vincent J.

doi:10.1145/3520312.3534862

articleJun 10, 2022GOLD OA

A systematic evaluation of large language models of code

FFFrank F. Xu UAUri Alon GNGraham Neubig VJVincent J. Hellendoorn

Carnegie Mellon University

Indexed incrossref

Abstract

Large language models (LMs) of code have recently shown tremendous promise in completing code and synthesizing code from natural language descriptions. However, the current state-of-the-art code LMs (e.g., Codex) are not publicly available, leaving many questions about their model and data design decisions. We aim to fill in some of these blanks through a systematic evaluation of the largest existing models: Codex, GPT-J, GPT-Neo, GPT-NeoX-20B, and CodeParrot, across various programming languages. Although Codex itself is not open-source, we find that existing opensource models do achieve close results in some programming languages, although targeted mainly for natural language modeling. We further identify an…

Citation impact

495

total citations

FWCI: 147.82
Percentile: 100%
References: 20

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Computer science
Programming language
Source code
Code (set theory)
Natural language
Open source
Artificial intelligence
Natural language processing

UN Sustainable Development Goals

Quality Education

No related works found for this paper.

Funding

NS
National Science Foundation
Award: 1815287