Learning Video Object Segmentation from Static Images

Perazzi, Federico; Khoreva, Anna; Benenson, Rodrigo; Schiele, Bernt; Sorkine‐Hornung, Alexander

doi:10.1109/cvpr.2017.372

articleJul 1, 2017Closed access

Learning Video Object Segmentation from Static Images

FPFederico Perazzi AKAnna Khoreva RBRodrigo Benenson BSBernt Schiele ASAlexander Sorkine‐Hornung

ETH Zurich · Walt Disney (United States) · +1 more institution

Indexed incrossref

Abstract

Inspired by recent advances of deep learning in instance segmentation and object tracking, we introduce the concept of convnet-based guidance applied to video object segmentation. Our model proceeds on a per-frame basis, guided by the output of the previous frame towards the object of interest in the next frame. We demonstrate that highly accurate object segmentation in videos can be enabled by using a convolutional neural network (convnet) trained with static images only. The key component of our approach is a combination of offline and online learning strategies, where the former produces a refined mask from the previous frame estimate and the latter allows to capture the appearance of the specific object…

Citation impact

610

total citations

FWCI: 24.97
Percentile: 100%
References: 74

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Computer science
Artificial intelligence
Segmentation
Object (grammar)
Frame (networking)
Convolutional neural network
Computer vision
Bounding overwatch

No related works found for this paper.