A five-level static cache architecture for web search engines
buir.contributor.author | Ulusoy, Özgür | |
dc.citation.epage | 840 | en_US |
dc.citation.issueNumber | 5 | en_US |
dc.citation.spage | 828 | en_US |
dc.citation.volumeNumber | 48 | en_US |
dc.contributor.author | Ozcan, R. | en_US |
dc.contributor.author | Altingovde, I. S. | en_US |
dc.contributor.author | Cambazoglu, B. B. | en_US |
dc.contributor.author | Junqueira, F. P. | en_US |
dc.contributor.author | Ulusoy, Özgür | en_US |
dc.date.accessioned | 2016-02-08T09:45:01Z | |
dc.date.available | 2016-02-08T09:45:01Z | |
dc.date.issued | 2012 | en_US |
dc.department | Department of Computer Engineering | en_US |
dc.description.abstract | Caching is a crucial performance component of large-scale web search engines, as it greatly helps reducing average query response times and query processing workloads on backend search clusters. In this paper, we describe a multi-level static cache architecture that stores five different item types: query results, precomputed scores, posting lists, precomputed intersections of posting lists, and documents. Moreover, we propose a greedy heuristic to prioritize items for caching, based on gains computed by using items' past access frequencies, estimated computational costs, and storage overheads. This heuristic takes into account the inter-dependency between individual items when making its caching decisions, i.e.; after a particular item is cached, gains of all items that are affected by this decision are updated. Our simulations under realistic assumptions reveal that the proposed heuristic performs better than dividing the entire cache space among particular item types at fixed proportions. © 2010 Elsevier Ltd. All rights reserved. | en_US |
dc.description.provenance | Made available in DSpace on 2016-02-08T09:45:01Z (GMT). No. of bitstreams: 1 bilkent-research-paper.pdf: 70227 bytes, checksum: 26e812c6f5156f83f0e77b261a471b5a (MD5) Previous issue date: 2012 | en |
dc.identifier.doi | 10.1016/j.ipm.2010.12.007 | en_US |
dc.identifier.issn | 0306-4573 | en_US |
dc.identifier.uri | http://hdl.handle.net/11693/21343 | en_US |
dc.language.iso | English | en_US |
dc.publisher | Elsevier Ltd | en_US |
dc.relation.isversionof | http://dx.doi.org/10.1016/j.ipm.20http://dx.doi.org/10.12.007 | en_US |
dc.source.title | Information Processing & Management | en_US |
dc.subject | Query processing | en_US |
dc.subject | Static caching | en_US |
dc.subject | Web search engines | en_US |
dc.subject | Access frequency | en_US |
dc.subject | Cache architecture | en_US |
dc.subject | Caching decisions | en_US |
dc.subject | Computational costs | en_US |
dc.subject | Greedy heuristics | en_US |
dc.subject | Inter - dependencies | en_US |
dc.title | A five-level static cache architecture for web search engines | en_US |
dc.type | Article | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- A five-level static cache architecture for web search engines.pdf
- Size:
- 548.6 KB
- Format:
- Adobe Portable Document Format
- Description:
- Full printable version