CodaMosa: Escaping Coverage Plateaus in Test Generation with Pre-trained Large Language Models

Lemieux, Caroline; Inala, Jeevana Priya; Lahiri, Shuvendu K.; Sen, Siddhartha

doi:10.1109/icse48619.2023.00085

articleMay 1, 2023Closed access

CodaMosa: Escaping Coverage Plateaus in Test Generation with Pre-trained Large Language Models

CLCaroline Lemieux JPJeevana Priya Inala SKShuvendu K. Lahiri SSSiddhartha Sen

University of British Columbia · Microsoft (United States)

Indexed incrossref

Abstract

Search-based software testing (SBST) generates high-coverage test cases for programs under test with a combination of test case generation and mutation. SBST's performance relies on there being a reasonable probability of generating test cases that exercise the core logic of the program under test. Given such test cases, SBST can then explore the space around them to exercise various parts of the program. This paper explores whether Large Language Models (LLMs) of code, such as OpenAI's Codex, can be used to help SBST's exploration. Our proposed algorithm, CodaMosa, conducts SBST until its coverage improvements stall, then asks Codex to provide example test cases for under-covered functions. These examples…

Citation impact

221

total citations

FWCI: 46.25
Percentile: 100%
References: 67

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Test (biology)
Computer science
Code coverage
Language model
Machine learning
Code (set theory)
Space (punctuation)
Artificial intelligence

UN Sustainable Development Goals

Quality Education

No related works found for this paper.