Integrated segmentation and recognition of connected Ottoman script
buir.advisor | Ulusoy, Özgür | |
dc.contributor.author | Yalnız, İsmet Zeki | |
dc.date.accessioned | 2016-01-08T18:06:40Z | |
dc.date.available | 2016-01-08T18:06:40Z | |
dc.date.issued | 2008 | |
dc.description | Cataloged from PDF version of article. | en_US |
dc.description | Includes bibliographical references leaves 43-45. | en_US |
dc.description.abstract | In this thesis, a novel context-sensitive segmentation and recognition method for connected letters in Ottoman script is proposed. This method first extracts a set of possible segments from a connected script and determines the candidate letters to which extracted segments are most similar. Next, a function is defined for scoring each different syntactically correct sequence of these candidate letters. To find the candidate letter sequence that maximizes the score function, a directed acyclic graph is constructed. The letters are finally recognized by computing the longest path in this graph. Experiments using a collection of printed Ottoman documents reveal that the proposed method provides very high precision and recall figures in terms of character recognition. In a further set of experiments we also demonstrate that the framework can be used as a building block for an information retrieval system for digital Ottoman archives. | en_US |
dc.description.provenance | Made available in DSpace on 2016-01-08T18:06:40Z (GMT). No. of bitstreams: 1 0003601.pdf: 31817635 bytes, checksum: 72d0106b7ad39cd018b0e0fe4d1be1ab (MD5) | en |
dc.description.statementofresponsibility | Yalnız, İsmet Zeki | en_US |
dc.format.extent | xii, 52 leaves | en_US |
dc.identifier.itemid | BILKUTUPB109234 | |
dc.identifier.uri | http://hdl.handle.net/11693/14738 | |
dc.language.iso | English | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.subject | Optical character recognition (OCR) | en_US |
dc.subject | Segmentation and recognition of connected scripts | en_US |
dc.subject | Connected scripts | en_US |
dc.subject | Information retrieval (IR) | en_US |
dc.subject.lcc | TA1640 .Y34 2008 | en_US |
dc.subject.lcsh | Optical character recognition devices. | en_US |
dc.subject.lcsh | Writing--Identification--Data processing. | en_US |
dc.title | Integrated segmentation and recognition of connected Ottoman script | en_US |
dc.type | Thesis | en_US |
thesis.degree.discipline | Computer Engineering | |
thesis.degree.grantor | Bilkent University | |
thesis.degree.level | Master's | |
thesis.degree.name | MS (Master of Science) |
Files
Original bundle
1 - 1 of 1