SAM2-UNet: segment anything 2 makes strong encoder for natural and medical image segmentation

Xiong, Xinyu; Wu, Zihuang; Tan, Shuangyi; Li, Wenxue; Tang, Feilong; Chen, Ying; Li, Siying; Ma, Jie; Li, Guanbin

doi:10.1007/s44267-025-00106-w

preprintVisual IntelligenceJan 13, 2026DIAMOND OA

SAM2-UNet: segment anything 2 makes strong encoder for natural and medical image segmentation

XXXinyu Xiong ZWZihuang Wu STShuangyi Tan WLWenxue Li FTFeilong Tang

Sun Yat-sen University · Jiangxi Normal University · +4 more institutions

Indexed inarxivcrossrefdatacitedoaj

Abstract

Abstract Image segmentation plays an important role in vision understanding. Recently, the emerging vision foundation models continuously achieved superior performance on various tasks. Following such success, in this paper, we prove that the Segment Anything Model 2 (SAM2) can be a strong encoder for U-shaped segmentation models. We propose a simple but effective framework, termed SAM2-UNet, for versatile image segmentation. Specifically, SAM2-UNet adopts the Hiera backbone of SAM2 as the encoder, while the decoder uses the classic U-shaped design. Additionally, adapters are inserted into the encoder to enable parameter-efficient fine-tuning. Preliminary experiments on various downstream tasks, such as…

Citation impact

43

total citations

FWCI: 369.38
Percentile: 100%
References: 20

Citations per year

Authors

9

Topics & keywords

Topics

Keywords

Natural (archaeology)
Segmentation
Computer vision
Encoder
Artificial intelligence
Image (mathematics)
Computer science
Image segmentation

No related works found for this paper.