• About
  • Policies
  • What is openaccess
  • Library
  • Contact
Advanced search
      View Item 
      •   BUIR Home
      • Scholarly Publications
      • Faculty of Engineering
      • Department of Computer Engineering
      • View Item
      •   BUIR Home
      • Scholarly Publications
      • Faculty of Engineering
      • Department of Computer Engineering
      • View Item
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      Matching ottoman words: an image retrieval approach to historical document indexing

      Thumbnail
      View / Download
      612.7 Kb
      Author
      Ataer, Esra
      Duygulu, Pınar
      Date
      2007-07
      Source Title
      Proceedings of the 6th ACM International Conference on Image and Video Retrieval, CIVR 2007
      Publisher
      ACM
      Pages
      341 - 347
      Language
      English
      Type
      Conference Paper
      Item Usage Stats
      109
      views
      115
      downloads
      Abstract
      Large archives of Ottoman documents are challenging to many historians all over the world. However, these archives remain inaccessible since manual transcription of such a huge volume is difficult. Automatic transcription is required, but due to the characteristics of Ottoman documents, character recognition based systems may not yield satisfactory results. It is also desirable to store the documents in image form since the documents may contain important drawings, especially the signatures. Due to these reasons, in this study we treat the problem as an image retrieval problem with the view that Ottoman words are images, and we propose a solution based on image matching techniques. The bag-of-visterms approach, which is shown to be successful to classify objects and scenes, is adapted for matching word images. Each word image is represented by a set of visual terms which are obtained by vector quantization of SIFT descriptors extracted from salient points. Similar words are then matched based on the similarity of the distributions of the visual terms. The experiments are carried out on printed and handwritten documents which included over 10,000 words. The results show that, the proposed system is able to retrieve words with high accuracies, and capture the semantic similarities between words. Copyright 2007 ACM.
      Keywords
      Bag-of-features
      Indexing
      Word-image matching
      Character recognition equipment
      Historic preservation
      Image matching
      Indexing (of information)
      Semantics
      Vector quantization
      Automatic transcription
      Historical document indexing
      Manual transcription
      Ottoman documents
      Recognition based systems
      Image retrieval
      Permalink
      http://hdl.handle.net/11693/26913
      Published Version (Please cite this version)
      http://dx.doi.org/10.1145/1282280.1282332
      Collections
      • Department of Computer Engineering 1368
      Show full item record

      Browse

      All of BUIRCommunities & CollectionsTitlesAuthorsAdvisorsBy Issue DateKeywordsTypeDepartmentsThis CollectionTitlesAuthorsAdvisorsBy Issue DateKeywordsTypeDepartments

      My Account

      Login

      Statistics

      View Usage StatisticsView Google Analytics Statistics

      Bilkent University

      If you have trouble accessing this page and need to request an alternate format, contact the site administrator. Phone: (312) 290 1771
      Copyright © Bilkent University - Library IT

      Contact Us | Send Feedback | Off-Campus Access | Admin | Privacy