articleJun 16, 2024Closed access
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Indexed incrossref
Abstract
Segment Anything Model (SAM) has emerged as a powerful tool for numerous vision applications. A key component that drives the impressive performance for zero-shot trans-fer and high versatility is a super large Transformer model trained on the extensive high-quality SA -1 B dataset. While beneficial, the huge computation cost of SAM model has limited its applications to wider real-world applications. To address this limitation, we propose EfficientSAMs, light-weight SAM models that exhibits decent performance with largely reduced complexity. Our idea is based on leveraging masked image pretraining, SAMI, which learns to reconstruct features from SAM image encoder for effective visual representation learning.…
Citation impact
189
total citations
- FWCI
- 43.03
- Percentile
- 100%
- References
- 94
Citations per year
Authors
12Topics & keywords
Topics
Keywords
- Computer science
- Image (mathematics)
- Artificial intelligence
- Computer vision
No related works found for this paper.