Theory-Guided Data Science: A New Paradigm for Scientific Discovery from Data

University of Minnesota · University of Cincinnati · +3 more institutions

Indexed inarxivcrossref

Abstract

Data science models, although successful in a number of commercial domains, have had limited applicability in scientific problems involving complex physical phenomena. Theory-guided data science (TGDS) is an emerging paradigm that aims to leverage the wealth of scientific knowledge for improving the effectiveness of data science models in enabling scientific discovery. The overarching vision of TGDS is to introduce scientific consistency as an essential component for learning generalizable models. Further, by producing scientifically interpretable models, TGDS aims to advance our scientific understanding by discovering novel domain insights. Indeed, the paradigm of TGDS has started to gain prominence in a…

Citation impact

1,470
total citations
FWCI
158.58
Percentile
100%
References
104
Citations per year

Authors

9

Topics & keywords

Keywords
  • Data science
  • Computer science
  • Leverage (statistics)
  • Scientific discovery
  • Data discovery
  • Domain (mathematical analysis)
  • Artificial intelligence
  • Cognitive science
UN Sustainable Development Goals
  • Climate action
No related works found for this paper.

Funding