- J. R. Rico-Juan; J. I Abreu
"A new editing scheme based on a fast two-string median computation applied to OCR"
Structural, Syntactic, and Statistical Pattern Recognition, ISBN: 978-3-642-14979-5, pp. 748--756, Cesme, Izmir, Turkey
This paper presents a new fast algorithm to compute an approximation to the median between two strings of characters representing a 2D shape and its application to a new classification scheme to decrease its error rate. The median string results from the application of certain edit operations from the minimum cost edit sequence to one of the original strings. The new dataset editing scheme relaxes the criterion to delete instances proposed by the Wilson Editing Proce- dure. In practice, not all instances misclassified by its near neighbors are pruned. Instead, an artificial instance is added to the dataset expecting to successfully classify the instance on the future. The new artificial instance is the median from the misclassified sample and its same-class nearest neighbor. The experiments over two widely used datasets of handwritten characters show this preprocessing scheme can reduce the classification error in about 78% of trials.
author = "J. R. Rico-Juan; J. I Abreu",
title = "A new editing scheme based on a fast two-string median computation applied to OCR",
address = "Cesme, Izmir, Turkey",
booktitle = "Structural, Syntactic, and Statistical Pattern Recognition",
editor = "E. R. Hancok; R. C. Wilson; T. W. Ilkay; F. Escolano",
isbn = "978-3-642-14979-5",
month = "aug",
pages = "748--756",
publisher = "Springer",
year = "2010"