Automatic ranking of information retrieval systems using data fusion

Nuray, R.; Can, F.

Automatic ranking of information retrieval systems using data fusion

Files

Automatic ranking of information retrieval systems using data fusion.pdf (455 KB)

Date

2006-05

Authors

Nuray, R.

Can, F.

BUIR Usage Stats

4
views

46
downloads

Citation Stats

Abstract

Measuring effectiveness of information retrieval (IR) systems is essential for research and development and for monitoring search quality in dynamic environments. In this study, we employ new methods for automatic ranking of retrieval systems. In these methods, we merge the retrieval results of multiple systems using various data fusion algorithms, use the top-ranked documents in the merged result as the "(pseudo) relevant documents," and employ these documents to evaluate and rank the systems. Experiments using Text REtrieval Conference (TREC) data provide statistically significant strong correlations with human-based assessments of the same systems. We hypothesize that the selection of systems that would return documents different from the majority could eliminate the ordinary systems from data fusion and provide better discrimination among the documents and systems. This could improve the effectiveness of automatic ranking. Based on this intuition, we introduce a new method for the selection of systems to be used for data fusion. For this purpose, we use the bias concept that measures the deviation of a system from the norm or majority and employ the systems with higher bias in the data fusion process. This approach provides even higher correlations with the human-based results. We demonstrate that our approach outperforms the previously proposed automatic ranking methods. © 2005 Elsevier Ltd. All rights reserved.

Source Title

Information Processing and Management

Publisher

Elsevier Ltd

Keywords

Data fusion, Experimentation, Information retrieval, Performance evaluation, Rank aggregation, Research and development management, Information retrieval

Permalink

http://hdl.handle.net/11693/23803

Published Version (Please cite this version)

http://dx.doi.org/10.1016/j.ipm.2005.03.023

Collections

Scholarly Publications - Computer Engineering

Language

English

Type

Article

Full item page

Automatic ranking of information retrieval systems using data fusion

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Automatic ranking of information retrieval systems using data fusion

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type