articleIEEE Transactions on Software EngineeringApr 1, 2005Closed access

Predicting the location and number of faults in large software systems

AT&T (United States)

Indexed incrossref

Abstract

Advance knowledge of which files in the next release of a large software system are most likely to contain the largest numbers of faults can be a very valuable asset. To accomplish this, a negative binomial regression model has been developed and used to predict the expected number of faults in each file of the next release of a system. The predictions are based on the code of the file in the current release, and fault and modification history of the file from previous releases. The model has been applied to two large industrial systems, one with a history of 17 consecutive quarterly releases over 4 years, and the other with nine releases over 2 years. The predictions were quite accurate: for each release of…

Citation impact

718
total citations
FWCI
84.76
Percentile
100%
References
20
Citations per year

Authors

3

Topics & keywords

Keywords
  • Computer science
  • Fault (geology)
  • Software
  • Code (set theory)
  • Set (abstract data type)
  • Fault model
  • Operating system
  • Programming language
No related works found for this paper.