Cost-aware strategies for query result caching in Web search engines
Altingovde, I. S.
ACM Transactions on the Web
Association for Computing Machinery
9:1 - 9:25
Item Usage Stats
Search engines and large-scale IR systems need to cache query results for efficiency and scalability purposes. Static and dynamic caching techniques (as well as their combinations) are employed to effectively cache query results. In this study, we propose cost-aware strategies for static and dynamic caching setups. Our research is motivated by two key observations: (i) query processing costs may significantly vary among different queries, and (ii) the processing cost of a query is not proportional to its popularity (i.e., frequency in the previous logs). The first observation implies that cache misses have different, that is, nonuniform, costs in this context. The latter observation implies that typical caching policies, solely based on query popularity, can not always minimize the total cost. Therefore, we propose to explicitly incorporate the query costs into the caching policies. Simulation results using two large Web crawl datasets and a real query log reveal that the proposed approach improves overall system performance in terms of the average query execution time. © 2011 ACM.
KeywordsQuery result caching
Web search engines
Query execution time
Static and dynamic
Published Version (Please cite this version)http://dx.doi.org/10.1145/1961659.1961663
Showing items related by title, author, creator and subject.
Şaykol, E.; Güdükbay, Uğur; Ulusoy, Özgür (Elsevier, 2005)Considering the fact that querying by low-level object features is essential in image and video data, an efficient approach for querying and retrieval by shape and color is proposed. The approach employs three specialized ...
Static index pruning in web search engines: combining term and document popularities with query views Altingovde, I. S.; Ozcan, R.; Ulusoy, O. (Association for Computing Machinery, 2012)Static index pruning techniques permanently remove a presumably redundant part of an inverted file, to reduce the file size and query processing time. These techniques differ in deciding which parts of an index can be ...
Altıngövde, İ. Ş.; Blanco, R.; Cambazoğlu, B. B.; Özcan, Rıfat; Sarıgil, Erdem; Ulusoy, Özgür (ACM, 2012-11)Despite the continuous efforts to improve the web search quality, a non-negligible fraction of user queries end up with very few or even no matching results in leading web search engines. In this work, we provide a detailed ...