OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels

Lou, Meng; Yu, Yizhou

doi:10.1109/cvpr52734.2025.00021

articleJun 10, 2025GREEN OA

OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels

MLMeng Lou YYYizhou Yu

University of Hong Kong

Indexed inarxivcrossref

Abstract

Top-down attention plays a crucial role in the human vision system, wherein the brain initially obtains a rough overview of a scene to discover salient cues (i.e., overview first), followed by a more careful finer-grained examination (i.e., look closely next). However, modern ConvNets remain confined to a pyramid structure that successively downsamples the feature map for receptive field expansion, neglecting this crucial biomimetic principle. We present OverLoCK, the first pure ConvNet backbone architecture that explicitly incorporates a top-down attention mechanism. Unlike pyramid backbone networks, our design features a branched architecture with three synergistic sub-networks: 1) a Base-Net that encodes…

Citation impact

47

total citations

FWCI: 88.02
Percentile: 100%
References: 0

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Computer science
Mixing (physics)
Context (archaeology)
Physics

No related works found for this paper.