Machine Learning Methods for Small Data Challenges in Molecular Science
Wuhan Textile University · Michigan State University
Abstract
Small data are often used in scientific and engineering research due to the presence of various constraints, such as time, cost, ethics, privacy, security, and technical limitations in data acquisition. However, big data have been the focus for the past decade, small data and their challenges have received little attention, even though they are technically more severe in machine learning (ML) and deep learning (DL) studies. Overall, the small data challenge is often compounded by issues, such as data diversity, imputation, noise, imbalance, and high-dimensionality. Fortunately, the current big data era is characterized by technological breakthroughs in ML, DL, and artificial intelligence (AI), which enable…
Citation impact
- FWCI
- 81.59
- Percentile
- 100%
- References
- 591
Authors
10Topics & keywords
- Chemistry
- Data science
- Nanotechnology
- Computer science
Funding
- NSNational Science FoundationAwards: R01AI164266, 2052983, 1761320, 1900473, IIS-1900473, DMS-1761320, DMS-2052983
- NANational Aeronautics and Space AdministrationAward: 80NSSC21M0023
- BSBristol-Myers SquibbAward: 65109
- PPfizer
- MSMichigan State University Foundation
- NNNational Natural Science Foundation of ChinaAwards: IIS-1900473, 11972266, 12271416, 11971367
- NSNuclear Safety and Security Commission
- NINational Institutes of HealthAwards: R01GM126189, R01AI164266, IIS-1900473
- NINational Institute of General Medical SciencesAward: R01GM126189
- NINational Institute of Allergy and Infectious DiseasesAward: R01AI164266
- DODivision of Mathematical SciencesAwards: DMS-1761320, DMS-2052983, DMS-1721024
- DODivision of Information and Intelligent SystemsAward: IIS-1900473