A comparative analysis of the efficiency of change metrics and static code attributes for defect prediction

Moser, Raimund; Pedrycz, Witold; Succi, Giancarlo

doi:10.1145/1368088.1368114

articleMay 10, 2008Closed access

A comparative analysis of the efficiency of change metrics and static code attributes for defect prediction

RMRaimund Moser WPWitold Pedrycz GSGiancarlo Succi

Free University of Bozen-Bolzano · University of Alberta

Indexed incrossref

Abstract

In this paper we present a comparative analysis of the predictive power of two different sets of metrics for defect prediction. We choose one set of product related and one set of process related software metrics and use them for classifying Java files of the Eclipse project as defective respective defect-free. Classification models are built using three common machine learners: logistic regression, Naïve Bayes, and decision trees. To allow different costs for prediction errors we perform cost-sensitive classification, which proves to be very successful: >75% percentage of correctly classified files, a recall of >80%, and a false positive rate

Citation impact

703

total citations

FWCI: 64.76
Percentile: 100%
References: 32

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Computer science
Eclipse
Data mining
Naive Bayes classifier
Java
Machine learning
Set (abstract data type)
Software bug

UN Sustainable Development Goals

Peace, Justice and strong institutions

No related works found for this paper.