Efficiency and effectiveness of query processing in cluster-based retrieval

Can, F.; Altingövde I.S.; Demir, E.

Efficiency and effectiveness of query processing in cluster-based retrieval

Files

Efficiency and effectiveness of query processing in cluster-based retrieval.pdf (263.84 KB)

Date

2004

Authors

Can, F.

Altingövde I.S.

Demir, E.

BUIR Usage Stats

1
views

15
downloads

Citation Stats

Abstract

Our research shows that for large databases, without considerable additional storage overhead, cluster-based retrieval (CBR) can compete with the time efficiency and effectiveness of the inverted index-based full search (FS). The proposed CBR method employs a storage structure that blends the cluster membership information into the inverted file posting lists. This approach significantly reduces the cost of similarity calculations for document ranking during query processing and improves efficiency. For example, in terms of in-memory computations, our new approach can reduce query processing time to 39% of FS. The experiments confirm that the approach is scalable and system performance improves with increasing database size. In the experiments, we use the cover coefficient-based clustering methodology (C3M), and the Financial Times database of TREC containing 210158 documents of size 564 MB defined by 229748 terms with total of 29545234 inverted index elements. This study provides CBR efficiency and effectiveness experiments using the largest corpus in an environment that employs no user interaction or user behavior assumption for clustering. © 2003 Elsevier Ltd. All rights reserved.

Source Title

Information Systems

Publisher

Elsevier

Keywords

Cluster-based retrieval, Clustering, Information retrieval, Performance, Query processing, Algorithms, Data structures, Indexing (of information), Information retrieval, Information science, Optimization, Performance, Cluster-based retrieval, Clustering, In-memory computations, Query processing, Query languages

Permalink

http://hdl.handle.net/11693/24168

Published Version (Please cite this version)

http://dx.doi.org/10.1016/S0306-4379(03)00062-0

Collections

Scholarly Publications - Computer Engineering

Language

English

Type

Article

Full item page

Efficiency and effectiveness of query processing in cluster-based retrieval

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Efficiency and effectiveness of query processing in cluster-based retrieval

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type