On document relevance and lexical cohesion between query terms

Date
2006
Authors
Vechtomova, O.
Karamuftuoglu, M.
Robertson, S. E.
Editor(s)
Advisor
Supervisor
Co-Advisor
Co-Supervisor
Instructor
Source Title
Information Processing and Management
Print ISSN
0306-4573
Electronic ISSN
Publisher
Elsevier
Volume
42
Issue
5
Pages
1230 - 1247
Language
English
Journal Title
Journal ISSN
Volume Title
Series
Abstract

Lexical cohesion is a property of text, achieved through lexical-semantic relations between words in text. Most information retrieval systems make use of lexical relations in text only to a limited extent. In this paper we empirically investigate whether the degree of lexical cohesion between the contexts of query terms' occurrences in a document is related to its relevance to the query. Lexical cohesion between distinct query terms in a document is estimated on the basis of the lexical-semantic relations (repetition, synonymy, hyponymy and sibling) that exist between there collocates - words that co-occur with them in the same windows of text. Experiments suggest significant differences between the lexical cohesion in relevant and non-relevant document sets exist. A document ranking method based on lexical cohesion shows some performance improvements. © 2006 Elsevier Ltd. All rights reserved.

Course
Other identifiers
Book Title
Citation
Published Version (Please cite this version)