Space efficient caching of query results in search engines

Date
2008-10
Advisor
Supervisor
Co-Advisor
Co-Supervisor
Instructor
Source Title
23rd International Symposium on Computer and Information Sciences, ISCIS 2008
Print ISSN
Electronic ISSN
Publisher
IEEE
Volume
Issue
Pages
Language
English
Type
Conference Paper
Journal Title
Journal ISSN
Volume Title
Series
Abstract

Web search engines serve millions of query requests per day. Caching query results is one of the most crucial mechanisms to cope with such a demanding load. In this paper, we propose an efficient storage model to cache document identifiers of query results. Essentially, we first cluster queries that have common result documents. Next, for each cluster, we attempt to store those common document identifiers in a more compact manner. Experimental results reveal that the proposed storage model achieves space reduction of up to 4%. The proposed model is envisioned to improve the cache hit rate and system throughput as it allows storing more query results within a particular cache space, in return to a negligible increase in the cost of preparing the final query result page. © 2008 IEEE.

Course
Other identifiers
Book Title
Keywords
Information retrieval, Information retrieval systems, Information science, Search engines, Cache hit rates, Document identifiers, Experimental results, Query results, Space reductions, Storage model, System throughput, Web searches, World Wide Web
Citation
Published Version (Please cite this version)