A Size-Aware Graph Embedding Approach to Remote Sensing Image Captioning With Object Relative Size Information
China University of Petroleum, East China
Abstract
Remote sensing image captioning is the task of automatically generating descriptive texts for remotely sensed scenes and objects. A common shortcoming of existing methods is the inadequate consideration of object size, which often leads to captions that either omit size information or provide imprecise size descriptions. To overcome this deficiency, we develop a novel framework composed of three modules: (a) an object confirmation and relative size estimation module, (b) a graph construction and graph convolution module, and (c) a caption generation module. Our framework comprehensively characterizes object size to generate more quantitatively informative captions. Furthermore, we introduce a new evaluation…
Citation impact
- FWCI
- 139.72
- Percentile
- 100%
- References
- 28
Authors
5- ZNZihao NiCorresponding
China University of Petroleum, East China
- YXYinghao Xu
China University of Petroleum, East China
- WZWeibo Zhang
China University of Petroleum, East China
- ZZZhaoyun Zong
China University of Petroleum, East China
- PRPeng Ren
China University of Petroleum, East China
Topics & keywords
- Correctness
- Object (grammar)
- Graph
- Benchmark (surveying)
- Embedding
- Construct (python library)
- Object detection
- Closed captioning