Vision-Language Models in Remote Sensing: Current progress and future trends
King Abdullah University of Science and Technology · New York University Abu Dhabi · +4 more institutions
Abstract
The remarkable achievements of ChatGPT and Generative Pre-trained Transformer 4 (GPT-4) have sparked a wave of interest and research in the field of large language models (LLMs) for artificial general intelligence (AGI). These models provide intelligent solutions that are closer to human thinking, enabling us to use general artificial intelligence (AI) to solve problems in various applications. However, in the field of remote sensing (RS), the scientific literature on the implementation of AGI remains relatively scant. Existing AI-related research in RS focuses primarily on visual-understanding tasks while neglecting the semantic understanding of the objects and their relationships. This is where vision-LMs…
Citation impact
- FWCI
- 230.07
- Percentile
- 100%
- References
- 306
Authors
5- LXLi XiangCorresponding
King Abdullah University of Science and Technology
- CWCongcong Wen
New York University Abu Dhabi
- YHYuan Hu
Peking University, State Key Laboratory of Remote Sensing Science
- ZYZhenghang Yuan
Technical University of Munich
- XXXiao Xiang Zhu
Munich Center for Machine Learning, Technical University of Munich
Topics & keywords
- Remote sensing
- Computer science
- Current (fluid)
- Geography
- Engineering
- Electrical engineering