Hypergraph models and algorithms for data-pattern-based clustering
buir.contributor.author | Aykanat, Cevdet | |
dc.citation.epage | 57 | en_US |
dc.citation.issueNumber | 1 | en_US |
dc.citation.spage | 29 | en_US |
dc.citation.volumeNumber | 9 | en_US |
dc.contributor.author | Ozdal, M. M. | en_US |
dc.contributor.author | Aykanat, Cevdet | en_US |
dc.date.accessioned | 2016-02-08T10:26:39Z | |
dc.date.available | 2016-02-08T10:26:39Z | en_US |
dc.date.issued | 2004 | en_US |
dc.department | Department of Computer Engineering | en_US |
dc.description.abstract | In traditional approaches for clustering market basket type data, relations among transactions are modeled according to the items occurring in these transactions. However, an individual item might induce different relations in different contexts. Since such contexts might be captured by interesting patterns in the overall data, we represent each transaction as a set of patterns through modifying the conventional pattern semantics. By clustering the patterns in the dataset, we infer a clustering of the transactions represented this way. For this, we propose a novel hypergraph model to represent the relations among the patterns. Instead of a local measure that depends only on common items among patterns, we propose a global measure that is based on the cooccurences of these patterns in the overall data. The success of existing hypergraph partitioning based algorithms in other domains depends on sparsity of the hypergraph and explicit objective metrics. For this, we propose a two-phase clustering approach for the above hypergraph, which is expected to be dense. In the first phase, the vertices of the hypergraph are merged in a multilevel algorithm to obtain large number of high quality clusters. Here, we propose new quality metrics for merging decisions in hypergraph clustering specifically for this domain. In order to enable the use of existing metrics in the second phase, we introduce a vertex-to-cluster affinity concept to devise a method for constructing a sparse hypergraph based on the obtained clustering. The experiments we have performed show the effectiveness of the proposed framework. | en_US |
dc.description.provenance | Made available in DSpace on 2016-02-08T10:26:39Z (GMT). No. of bitstreams: 1 bilkent-research-paper.pdf: 70227 bytes, checksum: 26e812c6f5156f83f0e77b261a471b5a (MD5) Previous issue date: 2004 | en_US |
dc.identifier.doi | 10.1023/B:DAMI.0000026903.59233.2a | en_US |
dc.identifier.issn | 1384-5810 | en_US |
dc.identifier.issn | 1573-756X | en_US |
dc.identifier.uri | http://hdl.handle.net/11693/24268 | en_US |
dc.language.iso | English | en_US |
dc.publisher | Springer | en_US |
dc.relation.isversionof | http://dx.doi.org/10.1023/B:DAMI.0000026903.59233.2a | en_US |
dc.source.title | Data Mining and Knowledge Discovery | en_US |
dc.subject | Clustering | en_US |
dc.subject | Data Mining | en_US |
dc.subject | Data Patterns | en_US |
dc.subject | Hypergraph Models | en_US |
dc.subject | Pattern Semantics | en_US |
dc.subject | Algorithms | en_US |
dc.subject | Computer Simulation | en_US |
dc.subject | Digital Storage | en_US |
dc.subject | Graph Theory | en_US |
dc.subject | Group Technology | en_US |
dc.subject | Mathematical Models | en_US |
dc.subject | Metric System | en_US |
dc.subject | Semantics | en_US |
dc.subject | Set Theory | en_US |
dc.subject | Database Systems | en_US |
dc.title | Hypergraph models and algorithms for data-pattern-based clustering | en_US |
dc.type | Article | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Hypergraph Models and Algorithms for Data-Pattern-Based Clustering.pdf
- Size:
- 426.81 KB
- Format:
- Adobe Portable Document Format
- Description:
- Full printable version