Printed character preclassification based on word structure
作者:
Highlights:
•
摘要
In today's office environment, electronic representation of information, intended as digital representation of a document, has been raised to the part of a protagonist. This paper advances a mixed topological/statistical approach to printed character preclassification, a way of separating a character set into disjointed categories. What is suggested is strictly related to the examination of particular character aggregates, for extracting special features in order to establish the membership of a character to one of seven categories. An improvement of correct recognition rate can be obtained by means of such an approach to optical character recognition.
论文关键词:Document segmentation,Text row analysis,Word structure,Character preclassification,Recognition accuracy
论文评审过程:Received 3 January 1991, Revised 12 December 1991, Available online 19 May 2003.
论文官网地址:https://doi.org/10.1016/0031-3203(91)90028-4