Efficient result caching mechanisms in search engines

buir.advisorUlusoy, Özgür
dc.contributor.authorSazoğlu, Fethi Burak
dc.date.accessioned2016-01-08T20:01:52Z
dc.date.available2016-01-08T20:01:52Z
dc.date.issued2014
dc.descriptionAnkara : The Department of Computer Engineering and the Graduate School of Engineering and Science of Bilkent University, 2014.en_US
dc.descriptionThesis (Master's) -- Bilkent University, 2014.en_US
dc.descriptionIncludes bibliographical references leaves 60-63.en_US
dc.description.abstractThe performance of a search engine depends on its components such as crawler, indexer and processor. The query latency, accuracy and recency of the results play crucial role in determining the performance. High performance can be provided with powerful hardware in the data center, but keeping the operational costs restrained is mandatory for search engines for commercial durability. This thesis focuses on techniques to boost the performance of search engines by means of reducing both the number of queries issued to the backend and the cost to process a query stream. This can be accomplished by taking advantage of the temporal locality of the queries. Caching the result for a recently issued query removes the need to reprocess this query when it is issued again by the same or different user. Therefore, deploying query result cache decreases the load on the resources of the search engine which increases the processing power. The main objective of this thesis is to improve search engine performance by enhancing productivity of result cache. This is done by endeavoring to maximize the cache hit rate and minimizing the processing cost by using the per query statistics such as frequency, timestamp and cost. While providing high hit rates and low processing costs improves performance, the freshness of the queries in the cache has to be considered as well for user satisfaction. Therefore, a variety of techniques are examined in this thesis to bound the staleness of cache results without blasting the backend with refresh queries. The offered techniques are demonstrated to be efficient by using real query log data from a commercial search engine.en_US
dc.description.provenanceMade available in DSpace on 2016-01-08T20:01:52Z (GMT). No. of bitstreams: 1 0006713.pdf: 585485 bytes, checksum: 14425c159f05ae9317e0d3c381d27095 (MD5)en
dc.description.statementofresponsibilitySazoğlu, Fethi Buraken_US
dc.format.extentxii, 65 leaves, graphicsen_US
dc.identifier.itemidB148326
dc.identifier.urihttp://hdl.handle.net/11693/16859
dc.language.isoEnglishen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectQuery result Cachingen_US
dc.subjectWeb search enginesen_US
dc.subjectFinancial costen_US
dc.subjectTime-to-liveen_US
dc.subject.lccTK7895 .S29 2014en_US
dc.subject.lcshCache memory.en_US
dc.subject.lcshSearch engines--Programming.en_US
dc.titleEfficient result caching mechanisms in search enginesen_US
dc.typeThesisen_US
thesis.degree.disciplineComputer Engineering
thesis.degree.grantorBilkent University
thesis.degree.levelMaster's
thesis.degree.nameMS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
0006713.pdf
Size:
571.76 KB
Format:
Adobe Portable Document Format