AIGC video detection based on the fusion of spatial-frequency-optical flow multimodal features
Beihang University · Nanchang University · +3 more institutions
Abstract
The rapid evolution of generative artificial intelligence (AI) (e.g., Sora, Hunyuan) makes it essential to develop effective detection strategies that can generalize across ever-evolving synthesis techniques. This study is motivated by the observation of a fundamental challenge in generative models: the inherent difficulty of maintaining cross-modal consistency between appearance and motion. To this end, we propose a multi-modal framework for AI generated content (AIGC) video forgery detection tasks, named cross-attention based video forgery detector (CrossAtt-VFD), based on joint multi-view analysis of content. Methodologically, we introduce a dual-branch architecture that simultaneously extracts…
Citation impact
- FWCI
- 142.54
- Percentile
- 100%
- References
- 0
Authors
6- HSHong ShengCorresponding
Beihang University
- WXWang Xuanqi
Nanchang University
- ZCZhang Chang
Beihang University
- WJWang Jiacheng
Beihang University
- DPDuan Pingxia
Alibaba Group (China)
Topics & keywords
- Fusion
- Flow (mathematics)
- Sensor fusion
- Optical flow
- Object detection