Learning to Compose Dynamic Tree Structures for Visual Contexts

Tang, Kaihua; Zhang, Hanwang; Wu, Baoyuan; Luo, Wenhan; Liu, Wei

doi:10.1109/cvpr.2019.00678

articleJun 1, 2019Closed access

Learning to Compose Dynamic Tree Structures for Visual Contexts

KTKaihua Tang HZHanwang Zhang BWBaoyuan Wu WLWenhan Luo WLWei Liu

Nanyang Technological University · Tencent (China)

Indexed incrossref

Abstract

We propose to compose dynamic tree structures that place the objects in an image into a visual context, helping visual reasoning tasks such as scene graph generation and visual Q&A. Our visual context tree model, dubbed VCTree, has two key advantages over existing structured object representations including chains and fully-connected graphs: 1) The efficient and expressive binary tree encodes the inherent parallel/hierarchical relationships among objects, e.g., ``clothes'' and ``pants'' are usually co-occur and belong to ``person''; 2) the dynamic structure varies from image to image and task to task, allowing more content-/task-specific message passing among objects. To construct a VCTree, we design a score…

Citation impact

533

total citations

FWCI: 23.27
Percentile: 100%
References: 91

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Computer science
Artificial intelligence
Tree (set theory)
Tree structure
Scene graph
Context (archaeology)
Graph
Visualization

No related works found for this paper.