Analysis of k-Fold Cross-Validation over Hold-Out Validation on Colossal Datasets for Quality Classification

Yadav, Sanjay Kumar; Shukla, Sanyam

doi:10.1109/iacc.2016.25

articleFeb 1, 2016Closed access

Analysis of k-Fold Cross-Validation over Hold-Out Validation on Colossal Datasets for Quality Classification

SKSanjay Kumar Yadav SSSanyam Shukla

Maulana Azad National Institute of Technology

Indexed incrossref

Abstract

While training a model with data from a dataset, we have to think of an ideal way to do so. The training should be done in such a way that while the model has enough instances to train on, they should not over-fit the model and at the same time, it must be considered that if there are not enough instances to train on, the model would not be trained properly and would give poor results when used for testing. Accuracy is important when it comes to classification and one must always strive to achieve the highest accuracy, provided there is not trade off with inexcusable time. While working on small datasets, the ideal choices are k-fold cross-validation with large value of k (but smaller than number of instances)…

Citation impact

695

total citations

FWCI: 25.18
Percentile: 100%
References: 23

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Cross-validation
Computer science
Model validation
Ideal (ethics)
Quality (philosophy)
Fold (higher-order function)
Value (mathematics)
Data mining

No related works found for this paper.