Parallel frequent item set mining with selective item replication

Özkural E.; Uçar, B.; Aykanat, Cevdet

Parallel frequent item set mining with selective item replication

Files

Parallel frequent item set mining with selective item replication.pdf (625.03 KB)

Date

2011

Authors

Özkural E.

Uçar, B.

Aykanat, Cevdet

BUIR Usage Stats

4
views

15
downloads

Citation Stats

Abstract

We introduce a transaction database distribution scheme that divides the frequent item set mining task in a top-down fashion. Our method operates on a graph where vertices correspond to frequent items and edges correspond to frequent item sets of size two. We show that partitioning this graph by a vertex separator is sufficient to decide a distribution of the items such that the subdatabases determined by the item distribution can be mined independently. This distribution entails an amount of data replication, which may be reduced by setting appropriate weights to vertices. The data distribution scheme is used in the design of two new parallel frequent item set mining algorithms. Both algorithms replicate the items that correspond to the separator. NoClique replicates the work induced by the separator and NoClique2 computes the same work collectively. Computational load balancing and minimization of redundant or collective work may be achieved by assigning appropriate load estimates to vertices. The experiments show favorable speedups on a system with small-to-medium number of processors for synthetic and real-world databases. © 2011 IEEE.

Source Title

IEEE Transactions on Parallel and Distributed Systems

Publisher

Institute of Electrical and Electronics Engineers

Keywords

Frequent item set mining, Parallel data mining, Mining methods and algorithms, Selective data replication, Graph partitioning by vertex separato

Permalink

http://hdl.handle.net/11693/21884

Published Version (Please cite this version)

http://dx.doi.org/10.1109/TPDS.2011.32

Collections

Scholarly Publications - Computer Engineering

Language

English

Type

Article

Full item page

Parallel frequent item set mining with selective item replication

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Parallel frequent item set mining with selective item replication

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type