Robust Scene Text Recognition with Automatic Rectification
Huazhong University of Science and Technology
Abstract
Recognizing text in natural images is a challenging task with many unsolved problems. Different from those in documents, words in natural images often possess irregular shapes, which are caused by perspective distortion, curved character placement, etc. We propose RARE (Robust text recognizer with Automatic REctification), a recognition model that is robust to irregular text. RARE is a speciallydesigned deep neural network, which consists of a Spatial Transformer Network (STN) and a Sequence Recognition Network (SRN). In testing, an image is firstly rectified via a predicted Thin-Plate-Spline (TPS) transformation, into a more "readable" image for the following SRN, which recognizes text through a sequence…
Citation impact
- FWCI
- 27.37
- Percentile
- 100%
- References
- 60
Authors
5- BSBaoguang ShiCorresponding
Huazhong University of Science and Technology
- XWXinggang Wang
Huazhong University of Science and Technology
- PLPengyuan Lyu
Huazhong University of Science and Technology
- CYCong Yao
Huazhong University of Science and Technology
- XBXiang Bai
Huazhong University of Science and Technology
Topics & keywords
- Computer science
- Text recognition
- Artificial intelligence
- Rectification
- Perspective distortion
- Transformer
- Pattern recognition (psychology)
- Perspective (graphical)