A hybrid approach for line segmentation in handwritten documents
Author
Adıgüzel, Hande
Şahin, Emre
Duygulu, Pınar
Date
2012Source Title
2012 International Conference on Frontiers in Handwriting Recognition
Print ISSN
1550-5235
Pages
503 - 508
Language
English
Type
Conference PaperItem Usage Stats
87
views
views
59
downloads
downloads
Abstract
This paper presents an approach for text line
segmentation which combines connected component
based and projection based information to take
advantage of aspects of both methods. The proposed
system finds baselines of each connected component.
Lines are detected by grouping baselines of connected
components belonging to each line by projection
information. Components are assigned to lines
according to different distance metrics with respect to
their size. This study is one of the rare studies that
apply line segmentation to Ottoman documents.
Further, it proposes a new method, Fourier curve
fitting, to detect the peaks in a projection profile. The
algorithm is demonstrated on different printed and
handwritten Ottoman datasets. Results show that the
method manages to segment lines both from printed
and handwritten documents under different writing
conditions at least with 92% accuracy.
Keywords
Connected componentDistance metrics
Fourier
Handwritten document
Hybrid approach
Line segmentation
Projection profile
Text line segmentation
Curve fitting