A new approach to search result clustering and labeling

Türel, Anıl; Can, Fazlı

A new approach to search result clustering and labeling

Files

A new approach to search result clustering and labeling.pdf (165.77 KB)

Date

2011

Authors

Türel, Anıl

Can, Fazlı

BUIR Usage Stats

5
views

17
downloads

Citation Stats

Abstract

Search engines present query results as a long ordered list of web snippets divided into several pages. Post-processing of retrieval results for easier access of desired information is an important research problem. In this paper, we present a novel search result clustering approach to split the long list of documents returned by search engines into meaningfully grouped and labeled clusters. Our method emphasizes clustering quality by using cover coefficient-based and sequential k-means clustering algorithms. A cluster labeling method based on term weighting is also introduced for reflecting cluster contents. In addition, we present a new metric that employs precision and recall to assess the success of cluster labeling. We adopt a comparative strategy to derive the relative performance of the proposed method with respect to two prominent search result clustering methods: Suffix Tree Clustering and Lingo. Experimental results in the publicly available AMBIENT and ODP-239 datasets show that our method can successfully achieve both clustering and labeling tasks. © 2011 Springer-Verlag Berlin Heidelberg.

Source Title

Information Retrieval Technology

Publisher

Springer, Berlin, Heidelberg

Permalink

http://hdl.handle.net/11693/28246

Published Version (Please cite this version)

http://dx.doi.org/10.1007/978-3-642-25631-8_26
https://doi.org/10.1007/978-3-642-25631-8

Collections

Scholarly Publications - Computer Engineering

Language

English

Type

Conference Paper

Full item page

A new approach to search result clustering and labeling

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

A new approach to search result clustering and labeling

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type