TEA: Temporal Excitation and Aggregation for Action Recognition
Tencent (China) · Nanjing University · +1 more institution
Abstract
Temporal modeling is key for action recognition in videos. It normally considers both short-range motions and long-range aggregations. In this paper, we propose a Temporal Excitation and Aggregation (TEA) block, including a motion excitation (ME) module and a multiple temporal aggregation (MTA) module, specifically designed to capture both short- and long-range temporal evolution. In particular, for short-range motion modeling, the ME module calculates the feature-level temporal differences from spatiotemporal features. It then utilizes the differences to excite the motion-sensitive channels of the features. The long-range temporal aggregations in previous works are typically achieved by stacking a large…
Citation impact
- FWCI
- 36.58
- Percentile
- 100%
- References
- 64
Authors
6Topics & keywords
- Computer science
- Convolution (computer science)
- Temporal resolution
- Block (permutation group theory)
- Residual
- Feature (linguistics)
- Artificial intelligence
- Temporal database
- Sustainable cities and communities