XML retrieval using pruned element-index files
Author
Altıngövde, İsmail Şengör
Atılgan, Duygu
Ulusoy, Özgür
Date
2010Source Title
Advances in Information Retrieval
Print ISSN
0302-9743
Publisher
Springer, Berlin, Heidelberg
Volume
5993
Pages
306 - 318
Language
English
Type
Conference PaperItem Usage Stats
107
views
views
60
downloads
downloads
Abstract
An element-index is a crucial mechanism for supporting content-only (CO) queries over XML collections. A full element-index that indexes each element along with the content of its descendants involves a high redundancy and reduces query processing efficiency. A direct index, on the other hand, only indexes the content that is directly under each element and disregards the descendants. This results in a smaller index, but possibly in return to some reduction in system effectiveness. In this paper, we propose using static index pruning techniques for obtaining more compact index files that can still result in comparable retrieval performance to that of a full index. We also compare the retrieval performance of these pruning based approaches to some other strategies that make use of a direct element-index. Our experiments conducted along with the lines of INEX evaluation framework reveal that pruned index files yield comparable to or even better retrieval performance than the full index and direct index, for several tasks in the ad hoc track. © 2010 Springer-Verlag Berlin Heidelberg.
Keywords
Ad hoc tracksEvaluation framework
High redundancy
Index files
Pruning techniques
Retrieval performance
System effectiveness
XML Retrieval
Markup languages
XML
Information retrieval
Permalink
http://hdl.handle.net/11693/28582Published Version (Please cite this version)
http://dx.doi.org/10.1007/978-3-642-12275-0-28https://doi.org/10.1007/978-3-642-12275-0