An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition
Huazhong University of Science and Technology
Abstract
Image-based sequence recognition has been a long-standing research topic in computer vision. In this paper, we investigate the problem of scene text recognition, which is among the most important and challenging tasks in image-based sequence recognition. A novel neural network architecture, which integrates feature extraction, sequence modeling and transcription into a unified framework, is proposed. Compared with previous systems for scene text recognition, the proposed architecture possesses four distinctive properties: (1) It is end-to-end trainable, in contrast to most of the existing algorithms whose components are separately trained and tuned. (2) It naturally handles sequences in arbitrary lengths,…
Citation impact
- FWCI
- 98.15
- Percentile
- 100%
- References
- 51
Authors
3Topics & keywords
- Artificial intelligence
- Computer science
- Artificial neural network
- End-to-end principle
- Computer vision
- Pattern recognition (psychology)
- Sequence (biology)
- Image (mathematics)