Distributed Foundation Models for Multi-Modal Learning in 6G Wireless Networks
Tsinghua University · Zhejiang University · +3 more institutions
Abstract
Benefiting from the ability to process and integrate data from various modalities, multi-modal foundation models (FMs) facilitate potential applications across a range of fields, including computer vision (CV), natural language processing (NLP), and diverse multi-modal applications such as imagetext retrieval. Currently, FMs are deployed on computing clusters for training and inference to meet their considerable computational demands. In the foreseeable future, the parameter size of FMs is expected to evolve further, posing challenges to both computation resources and energy supply. Fortunately, leveraging the next-generation wireless networks (6G) to aggregate substantial computation resources and multi-modal…
Citation impact
- FWCI
- 28.36
- Percentile
- 100%
- References
- 14
Authors
6Topics & keywords
- Computer science
- Data parallelism
- Pipeline (software)
- Artificial intelligence
- Wireless
- Context (archaeology)
- Distributed computing
- Deep learning