Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models

Lai, Yuxiang; Zhong, Jike; Li, Ming; Zhao, Shitian; Li, Yuheng; Psounis, Konstantinos; Yang, Xiaofeng

doi:10.1109/tmi.2026.3661001

articleIEEE Transactions on Medical ImagingJan 1, 2026Closed access

Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models

YLYuxiang Lai JZJike Zhong MLMing Li SZShitian Zhao YLYuheng Li

Emory University · University of Southern California · +3 more institutions

PubMed

Indexed incrossrefpubmed

Abstract

Vision-language models (VLMs) have achieved impressive progress in natural image reasoning, yet their potential in medical imaging remains underexplored. Medical vision-language tasks demand precise understanding and clinically coherent answers, which are difficult to achieve due to complexity of medical data and the scarcity of high-quality expert annotations. These challenges limit the effectiveness of conventional supervised fine-tuning (SFT) and Chain-of-Thought (CoT) strategies that work well in general domains. To address these challenges, we propose Med-R1, a reinforcement learning (RL)-enhanced VLM designed to improve generalization and reliability in medical reasoning. Med-R1 adopts Group Relative…

Citation impact

6

total citations

FWCI: 131.02
Percentile: 100%
References: 0

Too recent for citation history.

Authors

7

Topics & keywords

Topics

Keywords

Generalization
Reinforcement learning
Process (computing)
Reliability (semiconductor)
Medical imaging
Limit (mathematics)
Component (thermodynamics)
Quality (philosophy)

No related works found for this paper.

Funding

NI
National Institutes of Health
Awards: R01EB032680, R01DE033512, R01CA272991, U54CA274513