Osmanlica belgelerde kelime erişimi

Arifoǧlu, Damla; Duygulu, Pınar

Osmanlica belgelerde kelime erişimi

dc.citation.epage	529	en_US
dc.citation.spage	526	en_US
dc.contributor.author	Arifoǧlu, Damla	en_US
dc.contributor.author	Duygulu, Pınar	en_US
dc.coverage.spatial	Antalya, Turkey
dc.date.accessioned	2016-02-08T12:19:12Z
dc.date.available	2016-02-08T12:19:12Z
dc.date.issued	2011-04	en_US
dc.department	Department of Computer Engineering	en_US
dc.description	Date of Conference: 20-22 April 2011
dc.description	Conference name: IEEE 19th Signal Processing and Communications Applications Conference, SIU 2011
dc.description.abstract	Bu çalışmada, Osmanlıca arşivlerinin analizi amacıyla, kelime erişimi problemi iki farklı resim eşleme yöntemi ile çözülmeye çalışılmaktadır. Bu amaçla (1) Dinamik Zaman Bükmesi (DZB) tabanlı kelime eşleme yöntemi [7] ve (2) Şekil İçeriği (shape context) tanımlayıcısı [10] Osmanlıca belgeler üzerinde uyarlanmıştır. Öncelikle, verilen bir Osmanlıca belgedeki tüm alt-kelimeler bulunmuştur. Birinci yöntemde, her alt-kelime grubu için, üst ve alt kelime profili, siyah pikselden beyaz piksele geçiş sayısı ve dikey izdüşüm özniteliklerinden oluşturulmuş 4 parçalı öznitelik vektörü çıkartılmış, bu özniteliklerin birbirine olan uzaklığı DZB algoritmasıyla bulunmuştur. İkinci yöntemde ise, Şekil İçeriği tanımlayıcısı kullanılarak, alt-kelimelerin birbirine olan uzaklıkları hesaplanmıştır. Uygulanan yöntemler, Fuzuli’nin Leyla ve Mecnun divanının 10 sayfasından oluşan bir Osmanlıca veri kümesi üzerinde denenmiştir. In this paper, two image matching methods are adapted to retrieve words in Ottoman documents. The first method is based on Dynamic Time Warping (DTW) method proposed in [7], while the second method is based on the Shape Context descriptor [10]. Firstly, all sub-words in a given Ottoman document are extracted. In the first method, a 4-variant feature vector (upper and lower word profiles, background to ink transition, vertical projection) is calculated for each subword and feature vectors' distance to each other is found by DTW algorithm. In the second method, shape context descriptor is used to calculate the distances of sub-word images. The methods are tested on an Ottoman data set, which consists of 10 pages of Leyla and Mecnun Divan of Fuzuli. © 2011 IEEE.	en_US
dc.identifier.doi	10.1109/SIU.2011.5929703	en_US
dc.identifier.uri	http://hdl.handle.net/11693/28383	en_US
dc.language.iso	Turkish	en_US
dc.publisher	IEEE	en_US
dc.relation.isversionof	https://doi.org/10.1109/SIU.2011.5929703	en_US
dc.source.title	IEEE 19th Signal Processing and Communications Applications Conference, SIU 2011	en_US
dc.subject	Data sets	en_US
dc.subject	Descriptors	en_US
dc.subject	Feature vectors	en_US
dc.subject	Matching methods	en_US
dc.subject	ON dynamics	en_US
dc.subject	Shape contexts	en_US
dc.subject	Subwords	en_US
dc.subject	Vertical projection	en_US
dc.subject	Word profiles	en_US
dc.subject	Word retrieval	en_US
dc.subject	Signal processing	en_US
dc.subject	Image matching	en_US
dc.title	Osmanlica belgelerde kelime erişimi	en_US
dc.title.alternative	Word retrieval in Ottoman documents	en_US
dc.type	Conference Paper	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Osmanlıca_belgelerde_kelime_erişimi.pdf
Size:: 1.23 MB
Format:: Adobe Portable Document Format
Description:

Download

Collections

Scholarly Publications - Computer Engineering