    • Algorithms for within-cluster searches using inverted files 

      Altıngövde, İsmail Şengör; Can, Fazlı; Ulusoy, Özgür (Springer, 2006-11)
      Information retrieval over clustered document collections has two successive stages: first identifying the best-clusters and then the best-documents in these clusters that are most similar to the user query. In this paper, ...
    • Automatic categorization of ottoman literary texts by poet and time period 

      Can, Ethem F.; Can, Fazlı; Duygulu, Pınar; Kalpaklı, Mehmet (Springer, London, 2012)
      Millions of manuscripts and printed texts are available in the Ottoman language. The automatic categorization of Ottoman texts would make these documents much more accessible in various applications ranging from historical ...
    • Automatic Ranking of Retrieval Systems in Imperfect Environments 

      Nuray, Rabia; Can, Fazlı (ACM, 2003-07-08)
      The empirical investigation of the effectiveness of information retrieval (IR) systems requires a test collection, a set of query topics, and a set of relevance judgments made by human assessors for each query. Previous ...
    • Bilkent news portal: A personalizaba system with new event detection and tracking capabilities 

      Can, Fazlı; Koçberber, Seyit; Bağlıoğlu, Özgür; Kardaş, Süleyman; Öcalan, Hüseyin Çağdaş; Uyar, Erkan (ACM, 2008)
    • Chat mining for gender prediction 

      Küçükyılmaz, Tayfun; Cambazoğlu, B. Barla; Aykanat, Cevdet; Can, Fazlı (Springer, 2006-10)
      The aim of this paper is to investigate the feasibility of predicting the gender of a text document's author using linguistic evidence. For this purpose, term- and style-based classification techniques are evaluated over ...
    • CoDet: Sentence-based containment detection in news corpora 

      Varol, Emre; Can, Fazlı; Aykanat, Cevdet; Kaya, Oğuz (ACM, 2011)
      We study a generalized version of the near-duplicate detection problem which concerns whether a document is a subset of another document. In text-based applications, document containment can be observed in exact-duplicates, ...
    • Compressed multi-framed signature files: an index structure for fast information retrieval 

      Koçberber, Seyit; Can, Fazlı (ACM, 1999-02-03)
      A new indexing method, called Compressed Multi-Framed Signature File (C-MFSF), that uses a partial query evaluation strategy with compressed signature bit slices is presented. In C-MFSF, a signature file is divided into ...
    • A content-based social network study of evliyâ çelebi's seyahatnâme-bitlis section 

      Karbeyaz, Ceyhun; Can, Ethem F; Can, Fazlı; Kalpaklı, Mehmet (Springer, London, 2012)
      Evliyâ Çelebi, an Ottoman writer, scholar and world traveler, visited most of the territories and also some of the neighboring countries of the Ottoman Empire in the seventeenth century. He took notes about his trips and ...
    • Cover coefficient-based multi-document summarization 

      Ercan, Gönenç; Can, Fazlı (Springer, 2009-04)
      In this paper we present a generic, language independent multi-document summarization system forming extracts using the cover coefficient concept. Cover Coefficient-based Summarizer (CCS) uses similarity between sentences ...
    • Determining translation invariant characteristics of James Joyce’s Dubliners 

      Patton, J. M.; Can, Fazlı (John Benjamins Publishing Company, 2012)
      We provide a comparative stylometric analysis of the Dubliners stories of James Joyce by using its original and Murat Belge’s Turkish translation. We divide the stories into four categories as suggested by Belge and ...
    • Developing a text categorization template for Turkish news portals 

      Toraman, Çağrı; Can, Fazlı; Koçberber, Seyit (IEEE, 2011)
      In news portals, text category information is needed for news presentation. However, for many news stories the category information is unavailable, incorrectly assigned or too generic. This makes the text categorization a ...
    • Diversity and novelty in information retrieval 

      Santos, R. L. T.; Castells, P.; Altıngövde, I. S.; Can, Fazlı (ACM, 2013-07-08)
      This tutorial aims to provide a unifying account of current research on diversity and novelty in different IR domains, namely, in the context of search engines, recommender sys- tems, and data streams.
    • Diversity and novelty in web search, recommender systems and data streams 

      Santos, R. L. T.; Castells, P.; Altingovde, I. S.; Can, Fazlı (Association for Computing Machinery, 2014-02)
      This tutorial aims to provide a unifying account of current research on diversity and novelty in the domains of web search, recommender systems, and data stream processing.
    • Efficient processing of category-restricted queries for web directories 

      Altıngövde, İsmail Şengör; Can, Fazlı; Ulusoy, Özgür (Springer, 2008-03-04)
      We show that a cluster-skipping inverted index (CS-IIS) is a practical and efficient file structure to support category-restricted queries for searching Web directories. The query processing strategy with CS-IIS improves ...
    • Ensemble pruning for text categorization based on data partitioning 

      Toraman, Çağrı; Can, Fazlı (Springer, Berlin, Heidelberg, 2011)
      Ensemble methods can improve the effectiveness in text categorization. Due to computation cost of ensemble approaches there is a need for pruning ensembles. In this work we study ensemble pruning based on data partitioning. ...
    • First large-scale information retrieval experiments on Turkish texts 

      Can, Fazlı; Koçberber, Seyit; Balcık, Erman; Kaynak, Cihan; Öcalan, H. Çağdaş; Vursavaş, Onur M. (ACM, 2006-08)
      We present the results of the first large-scale Turkish information retrieval experiments performed on a TREC-like test collection. The test bed, which has been created for this study, contains 95.5 million words, 408,305 ...
    • GOOWE: geometrically optimum and online-weighted ensemble classifier for evolving data streams 

      Bonab, H. R.; Can, Fazlı (Association for Computing Machinery, 2018-01-25)
      Designing adaptive classifiers for an evolving data stream is a challenging task due to the data size and its dynamically changing nature. Combining individual classifiers in an online setting, the ensemble approach, is a ...
    • Large-scale cluster-based retrieval experiments on Turkish texts 

      Altıngövde, İsmail Şengör; Özcan, Rıfat; Öcalan Hüseyin C.; Can, Fazlı; Ulusoy, Özgür (ACM, 2007)
      We present cluster-based retrieval (CBR) experiments on the largest available Turkish document collection. Our experiments evaluate retrieval effectiveness and efficiency on both an automatically generated clustering ...
    • Less is more: a comprehensive framework for the number of components of ensemble classifiers 

      Bonab, H.; Can, Fazlı (IEEE, 2019)
      The number of component classifiers chosen for an ensemble greatly impacts the prediction ability. In this paper, we use a geometric framework for a priori determining the ensemble size, which is applicable to most of the ...
    • A new approach to search result clustering and labeling 

      Türel, Anıl; Can, Fazlı (Springer, Berlin, Heidelberg, 2011)
      Search engines present query results as a long ordered list of web snippets divided into several pages. Post-processing of retrieval results for easier access of desired information is an important research problem. In ...