UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
Tencent (China) · Chinese University of Hong Kong
Abstract
Large-kernel convolutional neural networks (ConvNets) have recently received extensive research attention, but two unresolved and critical issues demand further investigation. 1) The architectures of existing large-kernel ConvNets largely follow the design principles of conventional ConvNets or transformers, while the architectural design for large-kernel ConvNets remains under-addressed. 2) As transformers have dominated multiple modalities, it re-mains to be investigated whether ConvNets also have a strong universal perception ability in domains beyond vision. In this paper, we contribute from two aspects. 1) We propose four architectural guidelines for designing large- kernel ConvNets, the core of which is…
Citation impact
- FWCI
- 59.79
- Percentile
- 100%
- References
- 0
Authors
7Topics & keywords
- Computer science
- Series (stratigraphy)
- Cloud computing
- Kernel (algebra)
- Perception
- Artificial intelligence
- Point cloud
- Point (geometry)