Processing the manuscripts of Atatürk
Can, E. F.
SIU 2010 - IEEE 18th Signal Processing and Communications Applications Conference
882 - 885
Item Usage Stats
MetadataShow full item record
In this paper, as a first step to an easy and convenient way to access the manuscripts of Atatürk with a word based search engine, the preprocessing of digitalized documents and their line and word segmentation is studied. The techniques that are applied on printed documents may not yield satisfactory results. Due to this fact, more developed techniques are decided to be applied consisting of a technique based on Hough transform  for line segmentation and a technique that is based on dealing with skewness of lines for word segmentation. The results, which are acquired through studies that are conducted on the documents provided by Afet İnan and consisting of 30 pages , prove to be highly accurate and promising for future researches. ©2010 IEEE.