Rethinking ImageNet Pre-Training
Meta (Israel) · Facility for Antiproton and Ion Research
Abstract
We report competitive results on object detection and instance segmentation on the COCO dataset using standard models trained from random initialization. The results are no worse than their ImageNet pre-training counterparts even when using the hyper-parameters of the baseline system (Mask R-CNN) that were optimized for fine-tuning pre-trained models, with the sole exception of increasing the number of training iterations so the randomly initialized models may converge. Training from random initialization is surprisingly robust; our results hold even when: (i) using only 10% of the training data, (ii) for deeper and wider models, and (iii) for multiple tasks and metrics. Experiments show that ImageNet…
Citation impact
- FWCI
- 65.83
- Percentile
- 100%
- References
- 73
Authors
3Topics & keywords
- Initialization
- Computer science
- Training (meteorology)
- Artificial intelligence
- Object detection
- Machine learning
- Regularization (linguistics)
- Training set