preprintACM Computing SurveysMar 28, 2026HYBRID OA

Data-Centric Foundation Models in Computational Healthcare: A Survey

Shanghai Jiao Tong University · Rutgers Sexual and Reproductive Health and Rights

Indexed inarxivcrossrefdatacite

Abstract

The advent of foundation models (FMs) as an emerging suite of AI techniques has struck a wave of opportunities in computational healthcare. The interactive nature of these models, guided by pre-training data and human instructions, has ignited a data-centric AI paradigm that emphasizes better data characterization, quality, and scale. In healthcare AI, obtaining and processing high-quality clinical data records has been a longstanding challenge, encompassing data quantity, annotation, patient privacy, and ethics. In this survey, we investigate a wide range of data-centric approaches in the FM era (from model pre-training to inference) towards improving the healthcare workflow. We discuss key perspectives in AI…

No related works found for this paper.