Now showing items 1-15 of 15

    • Automatic categorization of ottoman literary texts by poet and time period 

      Can, E.F.; Can F.; Duygulu P.; Kalpakli, M. (2012)
      Millions of manuscripts and printed texts are available in the Ottoman language. The automatic categorization of Ottoman texts would make these documents much more accessible in various applications ranging from historical ...
    • Chat mining for gender prediction 

      Kucukyilmaz, T.; Cambazoglu, B. B.; Aykanat, C.; Can, F. (Springer, 2006)
      The aim of this paper is to investigate the feasibility of predicting the gender of a text document's author using linguistic evidence. For this purpose, term- and style-based classification techniques are evaluated over ...
    • Çağrı merkezi metin madenciliği yaklaşımı 

      Yigit, I. O.; Ates, A. F.; Guvercin, M.; Ferhatosmanoglu, H.; Gedik, B. (Institute of Electrical and Electronics Engineers Inc., 2017)
      Günümüzde çağrı merkezlerindeki görüşme kayıtlarının sesten metne dönüştürülebilmesi görüşme kaydı metinleri üzerinde metin madenciliği yöntemlerinin uygulanmasını mümkün kılmaktadır. Bu çalışma kapsamında görüşme ...
    • Developing a text categorization template for Turkish news portals 

      Toraman, C.; Can F.; Koçberber, S. (2011)
      In news portals, text category information is needed for news presentation. However, for many news stories the category information is unavailable, incorrectly assigned or too generic. This makes the text categorization a ...
    • Effect of inverted index partitioning schemes on performance of query processing in parallel text retrieval systems 

      Cambazoglu, B. B.; Catal, A.; Aykanat, C. (Springer, 2006)
      Shared-nothing, parallel text retrieval systems require an inverted index, representing a document collection, to be partitioned among a number of processors. In general, the index can be partitioned based on either the ...
    • Effective early termination techniques for text similarity join operator 

      Özalp, S. A.; Ulusoy, Ö. (Springer, 2005)
      Text similarity join operator joins two relations if their join attributes are textually similar to each other, and it has a variety of application domains including integration and querying of data from heterogeneous ...
    • Ensemble pruning for text categorization based on data partitioning 

      Toraman, C.; Can F. (2011)
      Ensemble methods can improve the effectiveness in text categorization. Due to computation cost of ensemble approaches there is a need for pruning ensembles. In this work we study ensemble pruning based on data partitioning. ...
    • First large-scale information retrieval experiments on Turkish texts 

      Can, F.; Kocberber, S.; Balcik, E.; Kaynak, C.; Ocalan, H. C.; Vursavas, O. M. (2006)
      We present the results of the first large-scale Turkish information retrieval experiments performed on a TREC-like test collection. The test bed, which has been created for this study, contains 95.5 million words, 408,305 ...
    • Information retrieval on Turkish texts 

      Can, F.; Kocberber, S.; Balcik, E.; Kaynak, C.; Ocalan, H. C.; Vursavas, O. M. (John Wiley & Sons, Inc., 2008-02)
      In this study, we investigate information retrieval (IR) on Turkish texts using a large-scale test collection that contains 408,305 documents and 72 ad hoc queries. We examine the effects of several stemming options and ...
    • Lexical cohesion based topic modeling for summarization 

      Ercan, G.; Cicekli, I. (Springer, 2008)
      In this paper, we attack the problem of forming extracts for text summarization. Forming extracts involves selecting the most representative and significant sentences from the text. Our method takes advantage of the lexical ...
    • Performance of query processing implementations in ranking-based text retrieval systems using inverted indices 

      Cambazoglu, B. B.; Aykanat, C. (Elsevier Ltd, 2006-07)
      Similarity calculations and document ranking form the computationally expensive parts of query processing in ranking-based text retrieval. In this work, for these calculations, 11 alternative implementation techniques are ...
    • Query expansion with terms selected using lexical cohesion analysis of documents 

      Vechtomova, O.; Karamuftuoglu, M. (Elsevier Ltd, 2007-07)
      We present new methods of query expansion using terms that form lexical cohesive links between the contexts of distinct query terms in documents (i.e., words surrounding the query terms in text). The link-forming terms ...
    • Squeezing the ensemble pruning: Faster and more accurate categorization for news portals 

      Toraman, C.; Can F. (2012)
      Recent studies show that ensemble pruning works as effective as traditional ensemble of classifiers (EoC). In this study, we analyze how ensemble pruning can improve text categorization efficiency in time-critical real-life ...
    • Summarization of documentaries 

      Demirtas, K.; Cicekli I.; Cicekli, N.K. (2010)
      Video summarization algorithms present condensed versions of a full length video by identifying the most significant parts of the video. In this paper, we propose an automatic video summarization method using the subtitles ...
    • Using lexical chains for keyword extraction 

      Ercan, G.; Cicekli, I. (Elsevier Ltd, 2007-11)
      Keywords can be considered as condensed versions of documents and short forms of their summaries. In this paper, the problem of automatic extraction of keywords from documents is treated as a supervised learning task. A ...