Random Forest: A Classification and Regression Tool for Compound Classification and QSAR Modeling
Indexed incrossrefpubmed
Abstract
A new classification and regression tool, Random Forest, is introduced and investigated for predicting a compound's quantitative or categorical biological activity based on a quantitative description of the compound's molecular structure. Random Forest is an ensemble of unpruned classification or regression trees created by using bootstrap samples of the training data and random feature selection in tree induction. Prediction is made by aggregating (majority vote or averaging) the predictions of the ensemble. We built predictive models for six cheminformatics data sets. Our analysis demonstrates that Random Forest is a powerful tool capable of delivering performance that is among the most accurate methods to…
Citation impact
3,552
total citations
- FWCI
- 18.20
- Percentile
- 100%
- References
- 23
Citations per year
Authors
6Topics & keywords
Topics
Keywords
- Random forest
- Cheminformatics
- Categorical variable
- Feature selection
- Artificial intelligence
- Computer science
- Regression
- Ensemble learning
UN Sustainable Development Goals
- Life in Land
No related works found for this paper.