articleFeb 1, 2016Closed access

Analysis of k-Fold Cross-Validation over Hold-Out Validation on Colossal Datasets for Quality Classification

Maulana Azad National Institute of Technology

Indexed incrossref

Abstract

While training a model with data from a dataset, we have to think of an ideal way to do so. The training should be done in such a way that while the model has enough instances to train on, they should not over-fit the model and at the same time, it must be considered that if there are not enough instances to train on, the model would not be trained properly and would give poor results when used for testing. Accuracy is important when it comes to classification and one must always strive to achieve the highest accuracy, provided there is not trade off with inexcusable time. While working on small datasets, the ideal choices are k-fold cross-validation with large value of k (but smaller than number of instances)…

Citation impact

695
total citations
FWCI
25.18
Percentile
100%
References
23
Citations per year

Authors

2

Topics & keywords

Keywords
  • Cross-validation
  • Computer science
  • Model validation
  • Ideal (ethics)
  • Quality (philosophy)
  • Fold (higher-order function)
  • Value (mathematics)
  • Data mining
No related works found for this paper.