Deep Visual Attention Prediction

Wang, Wenguan; Shen, Jianbing

doi:10.1109/tip.2017.2787612

articleIEEE Transactions on Image ProcessingDec 27, 2017GREEN OA

Deep Visual Attention Prediction

WWWenguan WangJSJianbing Shen

Beijing Institute of Technology

PubMed

Indexed inarxivcrossrefpubmed

Abstract

In this paper, we aim to predict human eye fixation with view-free scenes based on an end-to-end deep learning architecture. Although convolutional neural networks (CNNs) have made substantial improvement on human attention prediction, it is still needed to improve the CNN-based attention models by efficiently leveraging multi-scale features. Our visual attention network is proposed to capture hierarchical saliency information from deep, coarse layers with global saliency information to shallow, fine layers with local saliency response. Our model is based on a skip-layer network structure, which predicts human attention from multiple convolutional layers with various reception fields. Final saliency prediction…

Citation impact

581

total citations

FWCI: 23.66
Percentile: 100%
References: 65

Citations per year

Authors

2

WW
Wenguan WangCorresponding
Beijing Institute of Technology
JS
Jianbing Shen
Beijing Institute of Technology

Topics & keywords

Topics

Keywords

Redundancy (engineering)
Deep learning
Visual attention
Saliency map
Benchmark (surveying)
Convolutional neural network
Inference
Visualization

No related works found for this paper.