AIGC video detection based on the fusion of spatial-frequency-optical flow multimodal features

Sheng, Hong; Xuanqi, Wang; Chang, Zhang; Jiacheng, Wang; Pingxia, Duan; Yuwei, Wang

doi:10.23919/jsee.2026.000049

articleJournal of Systems Engineering and ElectronicsJan 1, 2026DIAMOND OA

AIGC video detection based on the fusion of spatial-frequency-optical flow multimodal features

HSHong ShengWXWang Xuanqi ZCZhang Chang WJWang Jiacheng DPDuan Pingxia

Beihang University · Nanchang University · +3 more institutions

Indexed incrossref

Abstract

The rapid evolution of generative artificial intelligence (AI) (e.g., Sora, Hunyuan) makes it essential to develop effective detection strategies that can generalize across ever-evolving synthesis techniques. This study is motivated by the observation of a fundamental challenge in generative models: the inherent difficulty of maintaining cross-modal consistency between appearance and motion. To this end, we propose a multi-modal framework for AI generated content (AIGC) video forgery detection tasks, named cross-attention based video forgery detector (CrossAtt-VFD), based on joint multi-view analysis of content. Methodologically, we introduce a dual-branch architecture that simultaneously extracts…

Citation impact

6

total citations

FWCI: 142.54
Percentile: 100%
References: 0

Too recent for citation history.

Authors

6

HS
Hong ShengCorresponding
Beihang University
WX
Wang Xuanqi
Nanchang University
ZC
Zhang Chang
Beihang University
WJ
Wang Jiacheng
Beihang University
DP
Duan Pingxia
Alibaba Group (China)

Topics & keywords

Topics

Keywords

Fusion
Flow (mathematics)
Sensor fusion
Optical flow
Object detection

No related works found for this paper.