Reducing latency cost in 2D sparse matrix partitioning
models

Selvitopi, O.; Aykanat, Cevdet

Reducing latency cost in 2D sparse matrix partitioning models

buir.contributor.author	Aykanat, Cevdet
dc.citation.epage	24	en_US
dc.citation.spage	1	en_US
dc.citation.volumeNumber	57	en_US
dc.contributor.author	Selvitopi, O.	en_US
dc.contributor.author	Aykanat, Cevdet	en_US
dc.date.accessioned	2018-04-12T10:53:30Z
dc.date.available	2018-04-12T10:53:30Z
dc.date.issued	2016	en_US
dc.department	Department of Computer Engineering	en_US
dc.description.abstract	Sparse matrix partitioning is a common technique used for improving performance of parallel linear iterative solvers. Compared to solvers used for symmetric linear systems, solvers for nonsymmetric systems offer more potential for addressing different multiple communication metrics due to the flexibility of adopting different partitions on the input and output vectors of sparse matrix-vector multiplication operations. In this regard, there exist works based on one-dimensional (1D) and two-dimensional (2D) fine-grain partitioning models that effectively address both bandwidth and latency costs in nonsymmetric solvers. In this work, we propose two new models based on 2D checkerboard and jagged partitioning. These models aim at minimizing total message count while maintaining a balance on communication volume loads of processors; hence, they address both bandwidth and latency costs. We evaluate all partitioning models on two nonsymmetric system solvers implemented using the widely adopted PETSc toolkit and conduct extensive experiments using these solvers on a modern system (a BlueGene/Q machine) successfully scaling them up to 8K processors. Along with the proposed models, we put practical aspects of eight evaluated models (two 1D- and six 2D-based) under thorough analysis. To the best of our knowledge, this is the first work that analyzes practical performance of 2D models on this scale. Among evaluated models, the models that rely on 2D jagged partitioning obtain the most promising results by striking a balance between minimizing bandwidth and latency costs.	en_US
dc.identifier.doi	10.1016/j.parco.2016.04.004	en_US
dc.identifier.issn	0167-8191
dc.identifier.uri	http://hdl.handle.net/11693/36792
dc.language.iso	English	en_US
dc.publisher	Elsevier BV	en_US
dc.relation.isversionof	http://dx.doi.org/10.1016/j.parco.2016.04.004	en_US
dc.source.title	Parallel Computing	en_US
dc.subject	Bandwidth overhead	en_US
dc.subject	Latency overhead	en_US
dc.subject	Nonsymmetric linear systems	en_US
dc.subject	Parallel iterative solvers	en_US
dc.subject	Sparse matrix partitioning	en_US
dc.subject	Sparse matrix-vector multiplication	en_US
dc.subject	Bandwidth	en_US
dc.subject	Costs	en_US
dc.subject	Iterative methods	en_US
dc.subject	Linear systems	en_US
dc.subject	Parallel processing systems	en_US
dc.subject	Bandwidth overheads	en_US
dc.subject	Latency overhead	en_US
dc.subject	Nonsymmetric linear systems	en_US
dc.subject	Parallel iterative solvers	en_US
dc.subject	Sparse matrices	en_US
dc.subject	Sparse matrix-vector multiplication	en_US
dc.subject	Matrix algebra	en_US
dc.title	Reducing latency cost in 2D sparse matrix partitioning models	en_US
dc.type	Article	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Reducing latency cost in 2D sparse matrix partitioning models.pdf
Size:: 1.66 MB
Format:: Adobe Portable Document Format
Description:: Full Printable Version

Download

Collections

Department of Computer Engineering