Altıngövde, İsmail ŞengörAtılgan, DuyguUlusoy, Özgür2016-02-082016-02-0820100302-9743http://hdl.handle.net/11693/28561Conference name: 8th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2009Date of Conference: December 7-9, 2009In this paper, we first employ the well known Cover-Coefficient Based Clustering Methodology (C3M) for clustering XML documents. Next, we apply index pruning techniques from the literature to reduce the size of the document vectors. Our experiments show that for certain cases, it is possible to prune up to 70% of the collection (or, more specifically, underlying document vectors) and still generate a clustering structure that yields the same quality with that of the original collection, in terms of a set of evaluation metrics. © 2010 Springer-Verlag Berlin Heidelberg.EnglishCover-coefficient based clusteringIndex pruningClustering indexCover-coefficient based clusteringDocument vectorsEvaluation metricsPruning methodsPruning techniquesBased clusteringDocument vectorsEvaluation metricsPruning methodsPruning techniquesQuery languagesXMLMarkup languagesQuality controlExploiting index pruning methods for clustering XML collectionsConference Paper10.1007/978-3-642-14556-8_3710.1007/978-3-642-14556-8