See More, Know More: Unsupervised Video Object Segmentation With Co-Attention Siamese Networks
Inception Institute of Artificial Intelligence · Shanghai Jiao Tong University · +2 more institutions
Abstract
We introduce a novel network, called as CO-attention Siamese Network (COSNet), to address the unsupervised video object segmentation task from a holistic view. We emphasize the importance of inherent correlation among video frames and incorporate a global co-attention mechanism to improve further the state-of-the-art deep learning based solutions that primarily focus on learning discriminative foreground representations over appearance and motion in short-term temporal segments. The co-attention layers in our network provide efficient and competent stages for capturing global correlations and scene context by jointly computing and appending co-attention responses into a joint feature space. We train COSNet…
Citation impact
- FWCI
- 34.55
- Percentile
- 100%
- References
- 101
Authors
6- XLXiankai LuCorresponding
Inception Institute of Artificial Intelligence
- WWWenguan Wang
Inception Institute of Artificial Intelligence
- CMChao Ma
Shanghai Jiao Tong University
- JSJianbing Shen
Beijing Institute of Technology, Inception Institute of Artificial Intelligence
- LSLing Shao
Inception Institute of Artificial Intelligence
Topics & keywords
- Computer science
- Discriminative model
- Artificial intelligence
- Segmentation
- Context (archaeology)
- Margin (machine learning)
- Focus (optics)
- Object (grammar)
- Reduced inequalities