Word retrieval in Ottoman documents [Osmanlica belgelerde keli̇me eri̇şi̇mi̇]
2011 IEEE 19th Signal Processing and Communications Applications Conference, SIU 2011
526 - 529
MetadataShow full item record
Please cite this item using this persistent URLhttp://hdl.handle.net/11693/28383
In this paper, two image matching methods are adapted to retrieve words in Ottoman documents. The first method is based on Dynamic Time Warping (DTW) method proposed in , while the second method is based on the Shape Context descriptor . Firstly, all sub-words in a given Ottoman document are extracted. In the first method, a 4-variant feature vector (upper and lower word profiles, background to ink transition, vertical projection) is calculated for each subword and feature vectors' distance to each other is found by DTW algorithm. In the second method, shape context descriptor is used to calculate the distances of sub-word images. The methods are tested on an Ottoman data set, which consists of 10 pages of Leyla and Mecnun Divan of Fuzuli. © 2011 IEEE.