preprintarXiv (Cornell University)Sep 21, 2016GREEN OA

Imbalanced-learn: A Python Toolbox to Tackle the Curse of Imbalanced Datasets in Machine Learning

University of Patras

Indexed inarxivdatacite

Abstract

Imbalanced-learn is an open-source python toolbox aiming at providing a wide range of methods to cope with the problem of imbalanced dataset frequently encountered in machine learning and pattern recognition. The implemented state-of-the-art methods can be categorized into 4 groups: (i) under-sampling, (ii) over-sampling, (iii) combination of over- and under-sampling, and (iv) ensemble learning methods. The proposed toolbox only depends on numpy, scipy, and scikit-learn and is distributed under MIT license. Furthermore, it is fully compatible with scikit-learn and is part of the scikit-learn-contrib supported project. Documentation, unit tests as well as integration tests are provided to ease usage and…

Citation impact

1,560
total citations
FWCI
Percentile
References
13
Citations per year

Authors

3

Topics & keywords

Keywords
  • Toolbox
  • Python (programming language)
  • Computer science
  • Machine learning
  • Artificial intelligence
  • Programming language
No related works found for this paper.