Data augmentation: A comprehensive survey of modern approaches
Cape Coast Technical University · University of Mines and Technology
Abstract
To ensure good performance, modern machine learning models typically require large amounts of quality annotated data. Meanwhile, the data collection and annotation processes are usually performed manually, and consume a lot of time and resources. The quality and representativeness of curated data for a given task is usually dictated by the natural availability of clean data in the particular domain as well as the level of expertise of developers involved. In many real-world application settings it is often not feasible to obtain sufficient training data. Currently, data augmentation is the most effective way of alleviating this problem. The main goal of data augmentation is to increase the volume, quality and…
Citation impact
- FWCI
- 68.91
- Percentile
- 100%
- References
- 324
Authors
2Topics & keywords
- Computer science
- Machine learning
- Data quality
- Representativeness heuristic
- Rendering (computer graphics)
- Data science
- Artificial intelligence
- Computer graphics
- Industry, innovation and infrastructure