preprintarXiv (Cornell University)Nov 21, 2019GREEN OA

Learning Spatial Fusion for Single-Shot Object Detection

Indexed inarxivdatacite

Abstract

Pyramidal feature representation is the common practice to address the challenge of scale variation in object detection. However, the inconsistency across different feature scales is a primary limitation for the single-shot detectors based on feature pyramid. In this work, we propose a novel and data driven strategy for pyramidal feature fusion, referred to as adaptively spatial feature fusion (ASFF). It learns the way to spatially filter conflictive information to suppress the inconsistency, thus improving the scale-invariance of features, and introduces nearly free inference overhead. With the ASFF strategy and a solid baseline of YOLOv3, we achieve the best speed-accuracy trade-off on the MS COCO dataset,…

Citation impact

466
total citations
FWCI
Percentile
References
44
Citations per year

Authors

3

Topics & keywords

Keywords
  • Computer science
  • Feature (linguistics)
  • Pyramid (geometry)
  • Artificial intelligence
  • Inference
  • Filter (signal processing)
  • Overhead (engineering)
  • Pattern recognition (psychology)
No related works found for this paper.