TALL: Temporal Activity Localization via Language Query
Southern California University for Professional Studies · University of Southern California · +1 more institution
Abstract
This paper focuses on temporal localization of actions in untrimmed videos. Existing methods typically train classifiers for a pre-defined list of actions and apply them in a sliding window fashion. However, activities in the wild consist of a wide combination of actors, actions and objects; it is difficult to design a proper activity list that meets users' needs. We propose to localize activities by natural language queries. Temporal Activity Localization via Language (TALL) is challenging as it requires: (1) suitable design of text and video representations to allow cross-modal matching of actions and language queries; (2) ability to locate actions accurately given features from sliding windows of limited…
Citation impact
- FWCI
- 14.42
- Percentile
- 100%
- References
- 52
Authors
4- JGJiyang GaoCorresponding
Southern California University for Professional Studies, University of Southern California
- CSChen Sun
Google (United States)
- ZYZhenheng Yang
Southern California University for Professional Studies, University of Southern California
- RNRam Nevatia
University of Southern California, Southern California University for Professional Studies
Topics & keywords
- Computer science
- Sentence
- Task (project management)
- Matching (statistics)
- Artificial intelligence
- CLIPS
- Modal
- Sliding window protocol
- Quality Education