TPOT: A Tree-Based Pipeline Optimization Tool for Automating Machine Learning

Midwest Orthopaedic Research Foundation · University of Pennsylvania

Indexed incrossref

Abstract

As data science becomes increasingly mainstream, there will be an ever-growing demand for data science tools that are more accessible, flexible, and scalable. In response to this demand, automated machine learning (AutoML) researchers have begun building systems that automate the process of designing and optimizing machine learning pipelines. In this chapter we present TPOT v0.3, an open source genetic programming-based AutoML system that optimizes a series of feature preprocessors and machine learning models with the goal of maximizing classification accuracy on a supervised classification task. We benchmark TPOT on a series of 150 supervised classification tasks and find that it significantly outperforms a…

Citation impact

552
total citations
FWCI
89.82
Percentile
100%
References
24
Citations per year

Authors

2

Topics & keywords

Keywords
  • Machine learning
  • Genetic programming
  • Computer science
  • Artificial intelligence
  • Benchmark (surveying)
  • Pipeline (software)
  • Feature (linguistics)
  • Domain (mathematical analysis)
No related works found for this paper.