articleMay 10, 2008Closed access

A comparative analysis of the efficiency of change metrics and static code attributes for defect prediction

Free University of Bozen-Bolzano · University of Alberta

Indexed incrossref

Abstract

In this paper we present a comparative analysis of the predictive power of two different sets of metrics for defect prediction. We choose one set of product related and one set of process related software metrics and use them for classifying Java files of the Eclipse project as defective respective defect-free. Classification models are built using three common machine learners: logistic regression, Naïve Bayes, and decision trees. To allow different costs for prediction errors we perform cost-sensitive classification, which proves to be very successful: >75% percentage of correctly classified files, a recall of >80%, and a false positive rate

Citation impact

703
total citations
FWCI
64.76
Percentile
100%
References
32
Citations per year

Authors

3

Topics & keywords

Keywords
  • Computer science
  • Eclipse
  • Data mining
  • Naive Bayes classifier
  • Java
  • Machine learning
  • Set (abstract data type)
  • Software bug
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.