Now showing items 1-16 of 16

    • Algorithms for within-cluster searches using inverted files 

      Altıngövde, İsmail Şengör; Can, Fazlı; Ulusoy, Özgür (Springer, 2006-11)
      Information retrieval over clustered document collections has two successive stages: first identifying the best-clusters and then the best-documents in these clusters that are most similar to the user query. In this paper, ...
    • An archiving model for a hierarchical information storage environment 

      Moinzadeh, K.; Berk, E. (Elsevier, 2000)
      We consider an archiving model for a database consisting of secondary and tertiary storage devices in which the query rate for a record declines as it ages. We propose a `dynamic' archiving policy based on the number of ...
    • Automatic Ranking of Retrieval Systems in Imperfect Environments 

      Nuray, Rabia; Can, Fazlı (ACM, 2003-07-08)
      The empirical investigation of the effectiveness of information retrieval (IR) systems requires a test collection, a set of query topics, and a set of relevance judgments made by human assessors for each query. Previous ...
    • First large-scale information retrieval experiments on Turkish texts 

      Can, Fazlı; Koçberber, Seyit; Balcık, Erman; Kaynak, Cihan; Öcalan, H. Çağdaş; Vursavaş, Onur M. (ACM, 2006-08)
      We present the results of the first large-scale Turkish information retrieval experiments performed on a TREC-like test collection. The test bed, which has been created for this study, contains 95.5 million words, 408,305 ...
    • A graph based approach to estimating lexical cohesion 

      Gürkök, Hayrettin; Karamuftuoglu, Murat; Schaal, Markus (ACM, 2008)
      Traditionally, information retrieval systems rank documents according to the query terms they contain. However, even if a document may contain all query terms, this does not guarantee that it is relevant to the query. The ...
    • Incremental cluster-based retrieval using compressed cluster-skipping inverted files 

      Altingovde, I. S.; Demir, E.; Can, F.; Ulusoy, Özgür (Association for Computing Machinery, 2008-06)
      We propose a unique cluster-based retrieval (CBR) strategy using a new cluster-skipping inverted file for improving query processing efficiency. The new inverted file incorporates cluster membership and centroid information ...
    • Novelty detection for topic tracking 

      Aksoy, C.; Can, F.; Kocberber, S. (John Wiley & Sons, Inc., 2012)
      Multisource web news portals provide various advantages such as richness in news content and an opportunity to follow developments from different perspectives. However, in such environments, news variety and quantity can ...
    • Ordinal evaluation and assignment problems 

      Atmaca, Abdullah; Oruç, A. Yavuz (IEEE, 2010)
      In many assignment problems, a set of documents such as research proposals, promotion dossiers, resumes of job applicants is assigned to a set of experts for ordinal evaluation, ranking, and classification. A desirable ...
    • Osmanlı arşivleri içerik-bazlı sorgulama (İBS) sistemi 

      Altıngövde, İsmail Şengör; Şaykol, Ediz; Ulusoy, Özgür; Güdükbay, Uğur; Çetin, A. Enis; Göçmen, M. (2006-04)
      We propose a content-based retrieval (CBR) system for digital Ottoman archive documents. In this system, the symbols extracted from the documents are matched with the most similar one in the symbol library, which is created ...
    • Performance of query processing implementations in ranking-based text retrieval systems using inverted indices 

      Cambazoglu, B. B.; Aykanat, Cevdet (Elsevier Ltd, 2006-07)
      Similarity calculations and document ranking form the computationally expensive parts of query processing in ranking-based text retrieval. In this work, for these calculations, 11 alternative implementation techniques are ...
    • Space efficient caching of query results in search engines 

      Özcan, Rıfat; Altıngövde, İsmail Şengör; Ulusoy Özgür (IEEE, 2008-10)
      Web search engines serve millions of query requests per day. Caching query results is one of the most crucial mechanisms to cope with such a demanding load. In this paper, we propose an efficient storage model to cache ...
    • Static index pruning in web search engines: combining term and document popularities with query views 

      Altingovde, I. S.; Ozcan, R.; Ulusoy, O. (Association for Computing Machinery, 2012)
      Static index pruning techniques permanently remove a presumably redundant part of an inverted file, to reduce the file size and query processing time. These techniques differ in deciding which parts of an index can be ...
    • Subband coding of binary textual images for document retrieval 

      Gerek, Ömer N.; Çetin, A. Enis; Tevfik, A. H. (IEEE, 1996)
      Efficient compression of binary textual images is very important for applications such as document archiving and retrieval, digital libraries and facsimile. The basic property of a textual image is the repetitions of small ...
    • Topic tracking using chronological term ranking 

      Acun, Bilge; Başpınar, Alper; Oǧuz, Ekin; Saraç, M.İlker; Can, Fazlı (2013-10)
      Topic tracking (TT) is an important component of topic detection and tracking (TDT) applications. TT algorithms aim to determine all subsequent stories of a certain topic based on a small number of initial sample stories. ...
    • Towards auto-documentary: Tracking the evolution of news stories 

      Duygulu, Pınar; Pan J.-Y.; Forsyth, D.A. (ACM, 2004)
      News videos constitute an important source of information for tracking and documenting important events. In these videos, news stories are often accompanied by short video shots that tend to be repeated during the course ...
    • Turkish keyphrase extraction using multi-criterion ranking 

      Özdemir, Bahadır; Çiçekli, İlyas (IEEE, 2009-09)
      Keyphrases have been extensively used for indexing and searching in databases and information retrieval systems. In addition, they provide useful information about semantic content of a document. In this paper, we propose ...