Large-scale cluster-based retrieval experiments on Turkish texts

Date
2007
Editor(s)
Advisor
Supervisor
Co-Advisor
Co-Supervisor
Instructor
Source Title
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Print ISSN
Electronic ISSN
Publisher
ACM
Volume
Issue
Pages
891 - 892
Language
English
Journal Title
Journal ISSN
Volume Title
Series
Abstract

We present cluster-based retrieval (CBR) experiments on the largest available Turkish document collection. Our experiments evaluate retrieval effectiveness and efficiency on both an automatically generated clustering structure and a manual classification of documents. In particular, we compare CBR effectiveness with full-text search (FS) and evaluate several implementation alternatives for CBR. Our findings reveal that CBR yields comparable effectiveness figures with FS. Furthermore, by using a specifically tailored cluster-skipping inverted index we significantly improve in-memory query processing efficiency of CBR in comparison to other traditional CBR techniques and even FS.

Course
Other identifiers
Book Title
Citation
Published Version (Please cite this version)