Integrated segmentation and recognition of connected Ottoman script

buir.advisorUlusoy, Özgür
dc.contributor.authorYalnız, İsmet Zeki
dc.date.accessioned2016-01-08T18:06:40Z
dc.date.available2016-01-08T18:06:40Z
dc.date.issued2008
dc.departmentDepartment of Computer Engineeringen_US
dc.descriptionAnkara : The Department of Computer Engineering and the Institute of Engineering and Science of Bilkent University, 2008.en_US
dc.descriptionThesis (Master's) -- Bilkent University, 2008.en_US
dc.descriptionIncludes bibliographical references leaves 43-45.en_US
dc.description.abstractIn this thesis, a novel context-sensitive segmentation and recognition method for connected letters in Ottoman script is proposed. This method first extracts a set of possible segments from a connected script and determines the candidate letters to which extracted segments are most similar. Next, a function is defined for scoring each different syntactically correct sequence of these candidate letters. To find the candidate letter sequence that maximizes the score function, a directed acyclic graph is constructed. The letters are finally recognized by computing the longest path in this graph. Experiments using a collection of printed Ottoman documents reveal that the proposed method provides very high precision and recall figures in terms of character recognition. In a further set of experiments we also demonstrate that the framework can be used as a building block for an information retrieval system for digital Ottoman archives.en_US
dc.description.degreeM.S.en_US
dc.description.statementofresponsibilityYalnız, İsmet Zekien_US
dc.format.extentxii, 52 leavesen_US
dc.identifier.itemidBILKUTUPB109234
dc.identifier.urihttp://hdl.handle.net/11693/14738
dc.language.isoEnglishen_US
dc.publisherBilkent Universityen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectOptical character recognition (OCR)en_US
dc.subjectSegmentation and recognition of connected scriptsen_US
dc.subjectConnected scriptsen_US
dc.subjectInformation retrieval (IR)en_US
dc.subject.lccTA1640 .Y34 2008en_US
dc.subject.lcshOptical character recognition devices.en_US
dc.subject.lcshWriting--Identification--Data processing.en_US
dc.titleIntegrated segmentation and recognition of connected Ottoman scripten_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
0003601.pdf
Size:
30.34 MB
Format:
Adobe Portable Document Format