Discovering important people and objects for egocentric video summarization
The University of Texas at Austin
Abstract
We present a video summarization approach for egocentric or “wearable” camera data. Given hours of video, the proposed method produces a compact storyboard summary of the camera wearer's day. In contrast to traditional keyframe selection techniques, the resulting summary focuses on the most important objects and people with which the camera wearer interacts. To accomplish this, we develop region cues indicative of high-level saliency in egocentric video — such as the nearness to hands, gaze, and frequency of occurrence — and learn a regressor to predict the relative importance of any new region based on these cues. Using these predictions and a simple form of temporal event detection, our method selects frames…
Citation impact
- FWCI
- 38.84
- Percentile
- 100%
- References
- 35
Authors
3Topics & keywords
- Automatic summarization
- Computer science
- Artificial intelligence
- Computer vision
- Information retrieval
- Computer graphics (images)
- Multimedia
- Reduced inequalities