Patrick McGuire: Telugu OCR Framework using Deep Learning. (arXiv:1509.05962v1 [stat.ML])

Monday, September 21, 2015

Telugu OCR Framework using Deep Learning. (arXiv:1509.05962v1 [stat.ML])

In this paper, we address the task of Optical Character Recognition(OCR) for the Telugu script. We present an end-to-end framework that segments the text image, classifies the characters and extracts lines using a language model. The segmentation is based on mathematical morphology. The classification module, which is the most challenging task of the three, is a deep convolutional neural network. The language is modelled as a third degree markov chain at the glyph level. Telugu script is a complex abugida and the language is agglutinative, making the problem hard. In this paper we apply the latest advances in neural networks to achieve acceptable error rates.

from cs.AI updates on arXiv.org http://ift.tt/1KIvpiF
via IFTTT

Patrick McGuire

Latest YouTube Video

Monday, September 21, 2015

Telugu OCR Framework using Deep Learning. (arXiv:1509.05962v1 [stat.ML])

No comments:

Click to Show Support

Click to Show Support