Places: A 10 Million Image Database for Scene Recognition
Massachusetts Institute of Technology · Universitat Oberta de Catalunya
Abstract
The rise of multi-million-item dataset initiatives has enabled data-hungry machine learning algorithms to reach near-human semantic classification performance at tasks such as visual object and scene recognition. Here we describe the Places Database, a repository of 10 million scene photographs, labeled with scene semantic categories, comprising a large and diverse list of the types of environments encountered in the world. Using the state-of-the-art Convolutional Neural Networks (CNNs), we provide scene classification CNNs (Places-CNNs) as baselines, that significantly outperform the previous approaches. Visualization of the CNNs trained on Places shows that object detectors emerge as an intermediate…
Citation impact
- FWCI
- 103.32
- Percentile
- 100%
- References
- 49
Authors
5Topics & keywords
- Computer science
- Artificial intelligence
- Computer vision
- Image processing
- Image segmentation
- Pattern recognition (psychology)
- Image (mathematics)
- Database