Two-Stage Rejection Algorithm to Reduce Search Space for Character Recognition in Ocr

Only available on StudyMode
  • Topic: Optical character recognition, Machine learning, Image scanner
  • Pages : 11 (2858 words )
  • Download(s) : 81
  • Published : December 26, 2012
Open Document
Text Preview
Two-Stage Rejection Algorithm to Reduce Search Space for Character Recognition in OCR

Srivardhini Mandipati, Gottumukkala Asisha, Preethi Raj S, and Chitrakala S

Department of Computer Science and Engineering, Easwari Engineering College, Chennai, India

Abstract. Optical Character Recognition converts text in images into a form that the computer can manipulate. The need for faster OCRs stems from the abundance of such text. This paper presents a Two-Stage Rejection Algorithm for reducing the search space of an OCR. It is tacit that the reduction in search space expedites an OCR. Preprocessing operations are applied on the input and features are extracted from them. These feature vectors are clustered and the Two-Stage Rejection Algorithm is applied for character recognition. With about the same character recognition rate as other OCRs, an OCR reinforced with the Two-Stage Rejection Algorithm is considerably faster.

Keywords: Optical Character Recognition, Feature Extraction, K-means. 1Introduction

Optical character recognition has been an active area of research for many decades. The fact that OCRs have the potential to simplify data entry in the future adds value to research in this area. OCRs use various pattern matching techniques for character recognition. Most OCRs typically use classifiers like SVM or neural networks for character recognition. The training process for these classifiers is time consuming. Moreover, with an increase in the number of classes, the comparisons made increases and consequently the time taken for character recognition increases. Hence, they cannot be easily extended to recognize characters from additional languages. The proposed system uses a structural approach as opposed to statistical approach for feature extraction. The strength of the structural method over the statistical one is its representation of a pattern that is similar to the way human perceive it. The structural features help retain the local shape description of the characters. Like all other OCRs, any image undergoes preprocessing. Additionally, the dataset is clustered and a Two-stage Rejection Algorithm is applied to it to reduce the search-space for character recognition. A considerable increase in the performance was observed during the experimentation. 2Related Works

Numerous works have been carried out in the field of OCR. When an OCR is being extended to recognize characters from multiple languages, the dataset increases which will considerably increase the number of comparisons required to recognize a character. This is all the more true when a single document contains characters from different languages. In our paper, we focus on the reduction of the search space for character recognition. This is done by clustering the training dataset and reordering the clustering. Weijie Su and Xin Jin [1] propose a hidden Markov model with parameter-optimized K-means clustering for handwritten character recognition. Here, they improve K-means clustering by considering the influence of neighboring pixels and different weights of pixels in different places. This model aims at improving the average accuracy of HMM with K-means clustering for handwriting characters recognition. Karthik Sheshadri et al. [2] address the problem of Kannada character recognition, and propose a recognition mechanism based on K-means clustering. Here they propose a segmentation technique to decompose each character into components from 3 base classes, thus reducing the magnitude of the problem. They have also used probabilistic and geometric seeding as heuristics to ensure uniformity of centroids from the extracted character with the centroids in the training database. Mu-King Tsay, Keh-Hwashyu, Pao-Chung Chang [3] designed a feature transformation module to extract discriminative features from the input scanned document to enhance the recognition performance. The initial feature transformation matrix is...
tracking img