Effect of inverted index partitioning schemes on performance of query processing in parallel text retrieval systems

Cambazoğlu, B. Barla; Çatal, A.; Aykanat, Cevdet

Effect of inverted index partitioning schemes on performance of query processing in parallel text retrieval systems

Files

Effect of inverted index partitioning schemes on performance of query processing in parallel text retrieval systems.pdf (446.5 KB)

Date

2006-11

Authors

Cambazoğlu, B. Barla

Çatal, A.

Aykanat, Cevdet

BUIR Usage Stats

3
views

23
downloads

Citation Stats

Abstract

Shared-nothing, parallel text retrieval systems require an inverted index, representing a document collection, to be partitioned among a number of processors. In general, the index can be partitioned based on either the terms or documents in the collection, and the way the partitioning is done greatly affects the query processing performance of the parallel system. In this work, we investigate the effect of these two index partitioning schemes on query processing. We conduct experiments on a 32-node PC cluster, considering the case where index is completely stored in disk. Performance results are reported for a large (30 GB) document collection using an MPI-based parallel query processing implementation. © Springer-Verlag Berlin Heidelberg 2006.

Source Title

21th International Symposium on Computer and Information Sciences – ISCIS 2006

Publisher

Springer

Keywords

Data processing, Information retrieval, Magnetic disk storage, Parallel processing systems, Program processors, Query languages, Text processing, Document collection, Index partitioning schemes, Inverted index partitioning, Parallel text retrieval systems, Indexing (of information)

Permalink

http://hdl.handle.net/11693/27280

Published Version (Please cite this version)

https://doi.org/10.1007/11902140_75

Collections

Scholarly Publications - Computer Engineering

Language

English

Type

Conference Paper

Full item page

Effect of inverted index partitioning schemes on performance of query processing in parallel text retrieval systems

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type