On document relevance and lexical cohesion between query terms

Date

2006

Authors

Vechtomova, O.
Karamuftuoglu, M.
Robertson, S. E.

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

Source Title

Information Processing and Management

Print ISSN

0306-4573

Electronic ISSN

Publisher

Elsevier

Volume

42

Issue

5

Pages

1230 - 1247

Language

English

Journal Title

Journal ISSN

Volume Title

Series

Abstract

Lexical cohesion is a property of text, achieved through lexical-semantic relations between words in text. Most information retrieval systems make use of lexical relations in text only to a limited extent. In this paper we empirically investigate whether the degree of lexical cohesion between the contexts of query terms' occurrences in a document is related to its relevance to the query. Lexical cohesion between distinct query terms in a document is estimated on the basis of the lexical-semantic relations (repetition, synonymy, hyponymy and sibling) that exist between there collocates - words that co-occur with them in the same windows of text. Experiments suggest significant differences between the lexical cohesion in relevant and non-relevant document sets exist. A document ranking method based on lexical cohesion shows some performance improvements. © 2006 Elsevier Ltd. All rights reserved.

Course

Other identifiers

Book Title

Citation