An Empirical Study of the Non-Determinism of ChatGPT in Code Generation

Ouyang, Shuyin; Zhang, Jie M.; Harman, Mark; Wang, Meng

doi:10.1145/3697010

articleACM Transactions on Software Engineering and MethodologySep 26, 2024HYBRID OA

An Empirical Study of the Non-Determinism of ChatGPT in Code Generation

SOShuyin Ouyang JMJie M. Zhang MHMark Harman MWMeng Wang

King's College London · Harman (United Kingdom) · +3 more institutions

Indexed incrossref

Abstract

There has been a recent explosion of research on Large Language Models (LLMs) for software engineering tasks, in particular code generation. However, results from LLMs can be highly unstable; non-deterministically returning very different code for the same prompt. Such non-determinism affects the correctness and consistency of the generated code, undermines developers’ trust in LLMs, and yields low reproducibility in LLM-based papers. Nevertheless, there is no work investigating how serious this non-determinism threat is. To fill this gap, this article conducts an empirical study on the non-determinism of ChatGPT in code generation. We chose to study ChatGPT because it is already highly prevalent in the code…

Citation impact

135

total citations

FWCI: 14.29
Percentile: 100%
References: 42

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Computer science
Determinism
Code generation
Programming language
Code (set theory)
Software engineering
Computer security
Epistemology

No related works found for this paper.

Funding

UR
UK Research and Innovation
Award: EP/S023356/1