Exploiting index pruning methods for clustering XML collections

Altıngövde, İsmail Şengör; Atılgan, Duygu; Ulusoy, Özgür

Exploiting index pruning methods for clustering XML collections

Files

Exploiting index pruning methods for clustering XML collections.pdf (212.24 KB)

Date

2010

Authors

Altıngövde, İsmail Şengör

Atılgan, Duygu

Ulusoy, Özgür

BUIR Usage Stats

1
views

12
downloads

Citation Stats

Abstract

In this paper, we first employ the well known Cover-Coefficient Based Clustering Methodology (C3M) for clustering XML documents. Next, we apply index pruning techniques from the literature to reduce the size of the document vectors. Our experiments show that for certain cases, it is possible to prune up to 70% of the collection (or, more specifically, underlying document vectors) and still generate a clustering structure that yields the same quality with that of the original collection, in terms of a set of evaluation metrics. © 2010 Springer-Verlag Berlin Heidelberg.

Source Title

Focused Retrieval and Evaluation

Publisher

Springer, Berlin, Heidelberg

Keywords

Cover-coefficient based clustering, Index pruning, Clustering index, Cover-coefficient based clustering, Document vectors, Evaluation metrics, Pruning methods, Pruning techniques, Based clustering, Document vectors, Evaluation metrics, Pruning methods, Pruning techniques, Query languages, XML, Markup languages, Quality control

Permalink

http://hdl.handle.net/11693/28561

Published Version (Please cite this version)

http://dx.doi.org/10.1007/978-3-642-14556-8_37
https://doi.org/10.1007/978-3-642-14556-8

Collections

Scholarly Publications - Computer Engineering

Language

English

Type

Conference Paper

Full item page

Exploiting index pruning methods for clustering XML collections

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Exploiting index pruning methods for clustering XML collections

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type