Partitioning models for scaling parallel sparse matrix-matrix multiplication

Akbudak, Kadir; Selvitopi, Oğuz; Aykanat, Cevdet

Partitioning models for scaling parallel sparse matrix-matrix multiplication

Files

Partitioning_models_for_scaling_parallel_sparse_matrix-matrix_multiplicatio.pdf (1.9 MB)

Date

2018

Authors

Akbudak, Kadir

Selvitopi, Oğuz

Aykanat, Cevdet

BUIR Usage Stats

3
views

74
downloads

Citation Stats

Abstract

We investigate outer-product--parallel, inner-product--parallel, and row-by-row-product--parallel formulations of sparse matrix-matrix multiplication (SpGEMM) on distributed memory architectures. For each of these three formulations, we propose a hypergraph model and a bipartite graph model for distributing SpGEMM computations based on one-dimensional (1D) partitioning of input matrices. We also propose a communication hypergraph model for each formulation for distributing communication operations. The computational graph and hypergraph models adopted in the first phase aim at minimizing the total message volume and balancing the computational loads of processors, whereas the communication hypergraph models adopted in the second phase aim at minimizing the total message count and balancing the message volume loads of processors. That is, the computational partitioning models reduce the bandwidth cost and the communication hypergraph models reduce the latency cost. Our extensive parallel experiments on up to 2048 processors for a wide range of realistic SpGEMM instances show that although the outer-product--parallel formulation scales better, the row-by-row-product--parallel formulation is more viable due to its significantly lower partitioning overhead and competitive scalability. For computational partitioning models, our experimental findings indicate that the proposed bipartite graph models are attractive alternatives to their hypergraph counterparts because of their lower partitioning overhead. Finally, we show that by reducing the latency cost besides the bandwidth cost through using the communication hypergraph models, the parallel SpGEMM time can be further improved up to 32%.

Source Title

ACM Transactions on Parallel Computing

Publisher

Association for Computing Machinery

Keywords

Sparse matrix-matrix multiplication, SpGEMM, Hypergraph partitioning, Graph partitioning, Communication cost, Bandwidth, Latency

Permalink

http://hdl.handle.net/11693/49306

Published Version (Please cite this version)

http://doi.org/10.1145/3155292

Collections

Scholarly Publications - Computer Engineering

Language

English

Type

Article

Full item page

Partitioning models for scaling parallel sparse matrix-matrix multiplication

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Partitioning models for scaling parallel sparse matrix-matrix multiplication

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type