Detail publikace

Offline Handwritten Text Recognition Using Support Vector Machines

RAJNOHA, M. BURGET, R. DUTTA, M. K. ISSAC, A.

Originální název

Offline Handwritten Text Recognition Using Support Vector Machines

Typ

článek ve sborníku ve WoS nebo Scopus

Jazyk

angličtina

Originální abstrakt

Comenia script is a novel handwritten text introduced at primary schools in the Czech Republic. This paper describes a method for handwritten text recognition (HWR) of this font. In particular it proposes a method for preprocessing and normalization of data and optical character recognition based on SVM classifier. We have trained and statistically evaluated several models, where we have focused on recognition of different styles of writing of the same characters - for the forensic purposes and identification of the author of a document. The best model has achieved 92.86 % accuracy without any further postprocessing, e.g. a spellchecker. We also proposed using more than one classification model for character recognition that has shown to increase accuracy when compared to a single model approach.

Klíčová slova

HWR; OCR; SVM; support vector machines; text recognition

Autoři

RAJNOHA, M.; BURGET, R.; DUTTA, M. K.; ISSAC, A.

Vydáno

2. 2. 2017

ISBN

978-1-5090-2796-5

Kniha

2017 4th International Conference on Signal Processing and Integrated Networks (SPIN)

Strany od

132

Strany do

136

Strany počet

5

URL

BibTex

@inproceedings{BUT133620,
  author="Martin {Rajnoha} and Radim {Burget} and Malay Kishore {Dutta} and Ashish {Issac}",
  title="Offline Handwritten Text Recognition Using Support Vector Machines",
  booktitle="2017 4th International Conference on Signal Processing and Integrated Networks (SPIN)",
  year="2017",
  pages="132--136",
  doi="10.1109/SPIN.2017.8049930",
  isbn="978-1-5090-2796-5",
  url="https://ieeexplore.ieee.org/document/8049930"
}