Osmanlıca kelimeleri eşleme

Ataer, Esra; Duygulu, Pınar

Osmanlıca kelimeleri eşleme

dc.contributor.author	Ataer, Esra	en_US
dc.contributor.author	Duygulu, Pınar	en_US
dc.coverage.spatial	Eskisehir, Turkey
dc.date.accessioned	2016-02-08T11:39:55Z
dc.date.available	2016-02-08T11:39:55Z
dc.date.issued	2007-06	en_US
dc.department	Department of Computer Engineering	en_US
dc.description	Date of Conference: 11-13 June 2007
dc.description	Conference name: IEEE 15th Signal Processing and Communications Applications, 2007
dc.description.abstract	Osmanlı arşivleri dünyanın pek çok yerinden araştırmacının ilgi alanına girmektedir. Fakat bu belgelerin elle çevirisi zor bir iş olduğu için, bu arşivler kullanılamaz durumdadır. Otomatik çeviri gerekmektedir, fakat Osmanlıca’nın yazma özelliklerinden dolayı karakter tabanlı tanıma sistemleri istenen başarıyı gösterememektedir. Ayrıca, belgeler minyatür ve tuğra gibi önemli kısımlar içerdiği için, imge formatında saklanmaları gerekmektedir. Bu nedenle, bu çalışmada Osmanlıca kelimeleri imge olarak görerek probleme imge erişim problemi olarak yaklaşıldı ve kelime eşleme tekniği üzerine bir çözüm önerisinde bulunuldu. Nesne tanımada başarılı olan görsel öğeler kümesi (bag-of-visterms) tekniği kelime eşleme işlemine uyarlandı ve böylece her kelime imgesi taç noktalarından çıkarılan SIFT özelliklerinin ¨ vektor¨ nicemlemesiyle sembolize edildi. Benzer kelimeler görsel ögelerin dağılımına göre eşlendi. Deneyler 10,000 kelimenin üzerindeki matbu ve elyazması belge üzerinde yapıldı. Sonuçlar sistemin benzer kelimeleri yüksek doğrulukla eşlediğini ve anlamsal benzerlikleri bulduğunu gösteriyor Large archives of Ottoman documents are challenging to many historians all over the world. However, these archives remain inaccessible since manual transcription of such a huge volume is difficult. Automatic transcription is required, but due to the characteristics of Ottoman documents, character recognition based systems may not yield satisfactory results. It is also desirable to store the documents in image form since the documents may contain important drawings, especially the signatures. Due to these reasons, in this study we treat the problem as an image retrieval problem with the view that Ottoman words are images, and we propose a solution based on image matching techniques. The bag-of-visterms approach, which is shown to be successful to classify objects and scenes, is adapted for matching word images. Each word image is represented by a set of visual terms which are obtained by vector quantization of SIFT descriptors extracted from salient points. Similar words are then matched based on the similarity of the distributions of the visual terms. The experiments are carried out on printed and handwritten documents which included over 10,000 words. The results show that, the proposed system is able to retrieve words with high accuracies, and capture the semantic similarities between words.	en_US
dc.identifier.doi	10.1109/SIU.2007.4298650	en_US
dc.identifier.uri	http://hdl.handle.net/11693/26932	en_US
dc.language.iso	Turkish	en_US
dc.publisher	IEEE	en_US
dc.relation.isversionof	http://dx.doi.org/10.1109/SIU.2007.4298650	en_US
dc.source.title	IEEE 15th Signal Processing and Communications Applications, SIU 2007	en_US
dc.subject	All over the world	en_US
dc.subject	Automatic transcription	en_US
dc.subject	Handwritten documents	en_US
dc.subject	Salient points	en_US
dc.subject	SIFT descriptors	en_US
dc.subject	Word images	en_US
dc.subject	Character recognition	en_US
dc.subject	Image enhancement	en_US
dc.subject	Image matching	en_US
dc.subject	Image retrieval	en_US
dc.subject	Information theory	en_US
dc.subject	Natural language processing systems	en_US
dc.subject	Photocopying	en_US
dc.subject	Signal processing	en_US
dc.subject	Transcription	en_US
dc.subject	Vector quantization	en_US
dc.title	Osmanlıca kelimeleri eşleme	en_US
dc.title.alternative	Matching Ottoman words	en_US
dc.type	Conference Paper	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Matching Ottoman words [Osmanlica kelimeleri eşleme].pdf
Size:: 691.18 KB
Format:: Adobe Portable Document Format
Description:: Full printable version

Download

Collections

Scholarly Publications - Computer Engineering