Analysis of k-Fold Cross-Validation over Hold-Out Validation on Colossal Datasets for Quality Classification
Maulana Azad National Institute of Technology
Abstract
While training a model with data from a dataset, we have to think of an ideal way to do so. The training should be done in such a way that while the model has enough instances to train on, they should not over-fit the model and at the same time, it must be considered that if there are not enough instances to train on, the model would not be trained properly and would give poor results when used for testing. Accuracy is important when it comes to classification and one must always strive to achieve the highest accuracy, provided there is not trade off with inexcusable time. While working on small datasets, the ideal choices are k-fold cross-validation with large value of k (but smaller than number of instances)…
Citation impact
- FWCI
- 25.18
- Percentile
- 100%
- References
- 23
Authors
2Topics & keywords
- Cross-validation
- Computer science
- Model validation
- Ideal (ethics)
- Quality (philosophy)
- Fold (higher-order function)
- Value (mathematics)
- Data mining