Text line extraction from multi-skewed handwritten documents
作者:
Highlights:
•
摘要
A novel text line extraction technique is presented for multi-skewed document images of handwritten English or Bengali text. It assumes that hypothetical water flows, from both left and right sides of the image frame, face obstruction from characters of text lines. The stripes of areas left unwetted on the image frame are finally labelled for extraction of text lines. The success rate of the technique, as observed experimentally, are 90.34% and 91.44% for handwritten Bengali and English document images, respectively. The work may contribute significantly for the development of applications related to optical character recognition of Bengali/English text.
论文关键词:OCR,Multi-skewed documents,Text line extraction,Connected component labelling,Skew angle detection,Touching line segmentation
论文评审过程:Received 8 November 2005, Revised 18 August 2006, Accepted 2 October 2006, Available online 30 November 2006.
论文官网地址:https://doi.org/10.1016/j.patcog.2006.10.002