SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Abstract
We present a novel and practical deep fully convolutional neural network architecture for semantic pixel-wise segmentation termed SegNet. This core trainable segmentation engine consists of an encoder network, a corresponding decoder network followed by a pixel-wise classification layer. The architecture of the encoder network is topologically identical to the 13 convolutional layers in the VGG16 network [1] . The role of the decoder network is to map the low resolution encoder feature maps to full input resolution feature maps for pixel-wise classification. The novelty of SegNet lies is in the manner in which the decoder upsamples its lower resolution input feature map(s). Specifically, the decoder uses…
Citation impact
- FWCI
- 89.58
- Percentile
- 100%
- References
- 97
Authors
3- VBVijay BadrinarayananCorresponding
University of Cambridge
- AKAlex Kendall
University of Cambridge
- RCRoberto Cipolla
University of Cambridge
Topics & keywords
- Computer science
- Artificial intelligence
- Encoder
- Upsampling
- Feature (linguistics)
- Segmentation
- Convolutional neural network
- Pattern recognition (psychology)