SeqTrack: Sequence to Sequence Learning for Visual Object Tracking

Chen, Xin; Peng, Houwen; Wang, Dong; Lu, Huchuan; Han, Hu

doi:10.1109/cvpr52729.2023.01400

articleJun 1, 2023Closed access

SeqTrack: Sequence to Sequence Learning for Visual Object Tracking

XCXin Chen HPHouwen Peng DWDong Wang HLHuchuan Lu HHHu Han

Dalian University of Technology · Microsoft Research (United Kingdom) · +1 more institution

Indexed incrossref

Abstract

In this paper, we present a new sequence-to-sequence learning framework for visual tracking, dubbed SeqTrack. It casts visual tracking as a sequence generation problem, which predicts object bounding boxes in an autoregressive fashion. This is different from prior Siamese trackers and transformer trackers, which rely on designing complicated head networks, such as classification and regression heads. SeqTrack only adopts a simple encoder-decoder transformer architecture. The encoder extracts visual features with a bidirectional transformer, while the decoder generates a sequence of bounding box values autoregressively with a causal transformer. The loss function is a plain cross-entropy. Such a sequence…

Citation impact

383

total citations

FWCI: 43.50
Percentile: 100%
References: 59

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Computer science
Minimum bounding box
Artificial intelligence
Transformer
Sequence learning
BitTorrent tracker
Encoder
Sequence (biology)

No related works found for this paper.

Funding

NN
National Natural Science Foundation of China
Award: 62022021,62293542