Static index pruning in web search engines: combining term and document popularities with query views
dc.citation.epage | 2-28 | en_US |
dc.citation.issueNumber | 1 | en_US |
dc.citation.spage | 2-1 | en_US |
dc.citation.volumeNumber | 30 | en_US |
dc.contributor.author | Altingovde, I. S. | en_US |
dc.contributor.author | Ozcan, R. | en_US |
dc.contributor.author | Ulusoy, O. | en_US |
dc.date.accessioned | 2016-02-08T09:48:43Z | |
dc.date.available | 2016-02-08T09:48:43Z | |
dc.date.issued | 2012 | en_US |
dc.department | Department of Computer Engineering | en_US |
dc.description.abstract | Static index pruning techniques permanently remove a presumably redundant part of an inverted file, to reduce the file size and query processing time. These techniques differ in deciding which parts of an index can be removed safely; that is, without changing the top-ranked query results. As defined in the literature, the query view of a document is the set of query terms that access to this particular document, that is, retrieves this document among its top results. In this paper, we first propose using query views to improve the quality of the top results compared against the original results. We incorporate query views in a number of static pruning strategies, namely term-centric, document-centric, term popularity based and document access popularity based approaches, and show that the new strategies considerably outperform their counterparts especially for the higher levels of pruning and for both disjunctive and conjunctive query processing. Additionally,we combine the notions of term and document access popularity to form new pruning strategies, and further extend these strategies with the query views. The new strategies improve the result quality especially for the conjunctive query processing, which is the default and most common search mode of a search engine. © 2012 ACM. | en_US |
dc.description.provenance | Made available in DSpace on 2016-02-08T09:48:43Z (GMT). No. of bitstreams: 1 bilkent-research-paper.pdf: 70227 bytes, checksum: 26e812c6f5156f83f0e77b261a471b5a (MD5) Previous issue date: 2012 | en |
dc.identifier.doi | 10.1145/2094072.2094074 | en_US |
dc.identifier.issn | 1046-8188 | |
dc.identifier.uri | http://hdl.handle.net/11693/21609 | |
dc.language.iso | English | en_US |
dc.publisher | Association for Computing Machinery | en_US |
dc.relation.isversionof | http://dx.doi.org/10.1145/2094072.2094074 | en_US |
dc.source.title | ACM Transactions on Information Systems | en_US |
dc.subject | Query view | en_US |
dc.subject | Static inverted index pruning | en_US |
dc.subject | Conjunctive queries | en_US |
dc.subject | Document access | en_US |
dc.subject | File sizes | en_US |
dc.subject | Inverted files | en_US |
dc.subject | Pruning strategy | en_US |
dc.subject | Pruning techniques | en_US |
dc.subject | Query results | en_US |
dc.subject | Query terms | en_US |
dc.subject | Query languages | en_US |
dc.subject | Query processing | en_US |
dc.subject | Search engines | en_US |
dc.subject | Information retrieval systems | en_US |
dc.title | Static index pruning in web search engines: combining term and document popularities with query views | en_US |
dc.type | Article | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Static index pruning in web search engines Combining term and document popularities with query views.pdf
- Size:
- 2.32 MB
- Format:
- Adobe Portable Document Format
- Description:
- Full printable version