Causal inference and the data-fusion problem
University of California, Los Angeles · Purdue University West Lafayette
Abstract
We review concepts, principles, and tools that unify current approaches to causal analysis and attend to new challenges presented by big data. In particular, we address the problem of data fusion-piecing together multiple datasets collected under heterogeneous conditions (i.e., different populations, regimes, and sampling methods) to obtain valid answers to queries of interest. The availability of multiple heterogeneous datasets presents new opportunities to big data analysts, because the knowledge that can be acquired from combined data would not be possible from any individual source alone. However, the biases that emerge in heterogeneous environments require new analytical tools. Some of these biases,…
Citation impact
- FWCI
- 53.01
- Percentile
- 100%
- References
- 64
Authors
2Topics & keywords
- Inference
- Causal inference
- Computer science
- Artificial intelligence
- Machine learning
- Econometrics
- Mathematics