A result cache invalidation scheme for web search engines

buir.advisorUlusoy, Özgür
dc.contributor.authorAlıcı, Şadiye
dc.date.accessioned2016-01-08T18:15:29Z
dc.date.available2016-01-08T18:15:29Z
dc.date.issued2011
dc.descriptionAnkara : The Department of Computer Engineering and the Graduate School of Engineering and Science of Bilkent University, 2011.en_US
dc.descriptionThesis (Master's) -- Bilkent University, 2011.en_US
dc.descriptionIncludes bibliographical references leaves 51-55.en_US
dc.description.abstractThe result cache is a vital component for the efficiency of large-scale web search engines, and maintaining the freshness of cached query results is a current research challenge. As a remedy to this problem, our work proposes a new mechanism to identify queries whose cached results are stale. The basic idea behind our mechanism is to maintain and compare the generation time of query results with the update times of posting lists and documents to decide on staleness of query results. The proposed technique is evaluated using a Wikipedia document collection with real update information and a real-life query log. Throughout the experiments, we compare our approach with two baseline strategies from literature together with a detailed evaluation. We show that our technique has good prediction accuracy, relative to the baseline based on the time-to-live (TTL) mechanism. Moreover, it is easy to implement and it incurs less processing overhead on the system relative to a recently proposed, more sophisticated invalidation mechanism.en_US
dc.description.provenanceMade available in DSpace on 2016-01-08T18:15:29Z (GMT). No. of bitstreams: 1 0005088.pdf: 2679437 bytes, checksum: 194dc0591ce1a46effbbec4bc4fc7924 (MD5)en
dc.description.statementofresponsibilityAlıcı, Şadiyeen_US
dc.format.extentxi, 55 leavesen_US
dc.identifier.urihttp://hdl.handle.net/11693/15243
dc.language.isoEnglishen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectWeb searchen_US
dc.subjectresult cacheen_US
dc.subjectcache invalidationen_US
dc.subjecttime-to-liveen_US
dc.subjectfreshnessen_US
dc.subjectadaptiveen_US
dc.subject.lccTK5105.884 .A55 2011en_US
dc.subject.lcshSearch engines--Programming.en_US
dc.subject.lcshWeb search engines--Mathematical models.en_US
dc.subject.lcshInformation storage and retrieval systems.en_US
dc.subject.lcshInformation retrieval.en_US
dc.subject.lcshInternet searching.en_US
dc.subject.lcshCache memory.en_US
dc.subject.lcshElectronic data processing--Backup processing alternatives.en_US
dc.titleA result cache invalidation scheme for web search enginesen_US
dc.typeThesisen_US
thesis.degree.disciplineComputer Engineering
thesis.degree.grantorBilkent University
thesis.degree.levelMaster's
thesis.degree.nameMS (Master of Science)

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
0005088.pdf
Size:
2.56 MB
Format:
Adobe Portable Document Format