A five-level static cache architecture for web search engines

buir.contributor.authorUlusoy, Özgür
dc.citation.epage840en_US
dc.citation.issueNumber5en_US
dc.citation.spage828en_US
dc.citation.volumeNumber48en_US
dc.contributor.authorOzcan, R.en_US
dc.contributor.authorAltingovde, I. S.en_US
dc.contributor.authorCambazoglu, B. B.en_US
dc.contributor.authorJunqueira, F. P.en_US
dc.contributor.authorUlusoy, Özgüren_US
dc.date.accessioned2016-02-08T09:45:01Z
dc.date.available2016-02-08T09:45:01Z
dc.date.issued2012en_US
dc.departmentDepartment of Computer Engineeringen_US
dc.description.abstractCaching is a crucial performance component of large-scale web search engines, as it greatly helps reducing average query response times and query processing workloads on backend search clusters. In this paper, we describe a multi-level static cache architecture that stores five different item types: query results, precomputed scores, posting lists, precomputed intersections of posting lists, and documents. Moreover, we propose a greedy heuristic to prioritize items for caching, based on gains computed by using items' past access frequencies, estimated computational costs, and storage overheads. This heuristic takes into account the inter-dependency between individual items when making its caching decisions, i.e.; after a particular item is cached, gains of all items that are affected by this decision are updated. Our simulations under realistic assumptions reveal that the proposed heuristic performs better than dividing the entire cache space among particular item types at fixed proportions. © 2010 Elsevier Ltd. All rights reserved.en_US
dc.description.provenanceMade available in DSpace on 2016-02-08T09:45:01Z (GMT). No. of bitstreams: 1 bilkent-research-paper.pdf: 70227 bytes, checksum: 26e812c6f5156f83f0e77b261a471b5a (MD5) Previous issue date: 2012en
dc.identifier.doi10.1016/j.ipm.2010.12.007en_US
dc.identifier.issn0306-4573en_US
dc.identifier.urihttp://hdl.handle.net/11693/21343en_US
dc.language.isoEnglishen_US
dc.publisherElsevier Ltden_US
dc.relation.isversionofhttp://dx.doi.org/10.1016/j.ipm.20http://dx.doi.org/10.12.007en_US
dc.source.titleInformation Processing & Managementen_US
dc.subjectQuery processingen_US
dc.subjectStatic cachingen_US
dc.subjectWeb search enginesen_US
dc.subjectAccess frequencyen_US
dc.subjectCache architectureen_US
dc.subjectCaching decisionsen_US
dc.subjectComputational costsen_US
dc.subjectGreedy heuristicsen_US
dc.subjectInter - dependenciesen_US
dc.titleA five-level static cache architecture for web search enginesen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
A five-level static cache architecture for web search engines.pdf
Size:
548.6 KB
Format:
Adobe Portable Document Format
Description:
Full printable version