An Analysis of the Automatic Bug Fixing Performance of ChatGPT
Johannes Gutenberg University Mainz · University College London
Abstract
To support software developers in finding and fixing software bugs, several automated program repair techniques have been introduced. Given a test suite, standard methods usually either synthesize a repair, or navigate a search space of software edits to find test-suite passing variants. Recent program repair methods are based on deep learning approaches. One of these novel methods, which is not primarily intended for automated program repair, but is still suitable for it, is ChatGPT. The bug fixing performance of ChatGPT, however, is so far unclear. Therefore, in this paper we evaluate ChatGPT on the standard bug fixing benchmark set, QuixBugs, and compare the performance with the results of several other…
Citation impact
- FWCI
- 61.60
- Percentile
- 100%
- References
- 26
Authors
4Topics & keywords
- Computer science
- Test suite
- Suite
- Benchmark (surveying)
- Software bug
- Software
- Set (abstract data type)
- Deep learning