Exploiting index pruning methods for clustering XML collections

dc.citation.epage386en_US
dc.citation.spage379en_US
dc.citation.volumeNumber6203en_US
dc.contributor.authorAltıngövde, İsmail Şengören_US
dc.contributor.authorAtılgan, Duyguen_US
dc.contributor.authorUlusoy, Özgüren_US
dc.coverage.spatialBrisbane, Australiaen_US
dc.date.accessioned2016-02-08T12:23:53Z
dc.date.available2016-02-08T12:23:53Z
dc.date.issued2010en_US
dc.departmentDepartment of Computer Engineeringen_US
dc.descriptionConference name: 8th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2009en_US
dc.descriptionDate of Conference: December 7-9, 2009en_US
dc.description.abstractIn this paper, we first employ the well known Cover-Coefficient Based Clustering Methodology (C3M) for clustering XML documents. Next, we apply index pruning techniques from the literature to reduce the size of the document vectors. Our experiments show that for certain cases, it is possible to prune up to 70% of the collection (or, more specifically, underlying document vectors) and still generate a clustering structure that yields the same quality with that of the original collection, in terms of a set of evaluation metrics. © 2010 Springer-Verlag Berlin Heidelberg.en_US
dc.description.provenanceMade available in DSpace on 2016-02-08T12:23:53Z (GMT). No. of bitstreams: 1 bilkent-research-paper.pdf: 70227 bytes, checksum: 26e812c6f5156f83f0e77b261a471b5a (MD5) Previous issue date: 2010en
dc.identifier.doi10.1007/978-3-642-14556-8_37en_US
dc.identifier.doi10.1007/978-3-642-14556-8en_US
dc.identifier.issn0302-9743en_US
dc.identifier.urihttp://hdl.handle.net/11693/28561en_US
dc.language.isoEnglishen_US
dc.publisherSpringer, Berlin, Heidelbergen_US
dc.relation.isversionofhttp://dx.doi.org/10.1007/978-3-642-14556-8_37en_US
dc.relation.isversionofhttps://doi.org/10.1007/978-3-642-14556-8en_US
dc.source.titleFocused Retrieval and Evaluationen_US
dc.subjectCover-coefficient based clusteringen_US
dc.subjectIndex pruningen_US
dc.subjectClustering indexen_US
dc.subjectCover-coefficient based clusteringen_US
dc.subjectDocument vectorsen_US
dc.subjectEvaluation metricsen_US
dc.subjectPruning methodsen_US
dc.subjectPruning techniquesen_US
dc.subjectBased clusteringen_US
dc.subjectDocument vectorsen_US
dc.subjectEvaluation metricsen_US
dc.subjectPruning methodsen_US
dc.subjectPruning techniquesen_US
dc.subjectQuery languagesen_US
dc.subjectXMLen_US
dc.subjectMarkup languagesen_US
dc.subjectQuality controlen_US
dc.titleExploiting index pruning methods for clustering XML collectionsen_US
dc.typeConference Paperen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Exploiting index pruning methods for clustering XML collections.pdf
Size:
212.24 KB
Format:
Adobe Portable Document Format
Description:
Full printable version