PhysFormer: Facial Video-based Physiological Measurement with Temporal Difference Transformer
University of Oulu · University of Oxford · +2 more institutions
Abstract
Remote photoplethysmography (rPPG), which aims at measuring heart activities and physiological signals from facial video without any contact, has great potential in many applications. Recent deep learning approaches focus on mining subtle rPPG clues using convolutional neural networks with limited spatio-temporal receptive fields, which neglect the long-range spatio-temporal perception and interaction for rPPG modeling. In this paper, we propose the PhysFormer, an end-to-end video transformer based architecture, to adaptively aggregate both local and global spatio-temporal features for rPPG representation enhancement. As key modules in PhysFormer, the temporal difference transformers first enhance the…
Citation impact
- FWCI
- 76.68
- Percentile
- 100%
- References
- 107
Authors
6Topics & keywords
- Computer science
- Artificial intelligence
- Transformer
- Feature learning
- Overfitting
- Deep learning
- Convolutional neural network
- Segmentation