A novel method for scaling iterative solvers: avoiding latency overhead of parallel sparse-matrix vector multiplies

Selvitopi, R. O.; Ozdal, M. M.; Aykanat, Cevdet

A novel method for scaling iterative solvers: avoiding latency overhead of parallel sparse-matrix vector multiplies

buir.contributor.author	Aykanat, Cevdet
dc.citation.epage	645	en_US
dc.citation.issueNumber	3	en_US
dc.citation.spage	632	en_US
dc.citation.volumeNumber	26	en_US
dc.contributor.author	Selvitopi, R. O.	en_US
dc.contributor.author	Ozdal, M. M.	en_US
dc.contributor.author	Aykanat, Cevdet	en_US
dc.date.accessioned	2016-02-08T09:59:04Z
dc.date.available	2016-02-08T09:59:04Z
dc.date.issued	2015	en_US
dc.department	Department of Computer Engineering	en_US
dc.description.abstract	In parallel linear iterative solvers, sparse matrix vector multiplication (SpMxV) incurs irregular point-to-point (P2P) communications, whereas inner product computations incur regular collective communications. These P2P communications cause an additional synchronization point with relatively high message latency costs due to small message sizes. In these solvers, each SpMxV is usually followed by an inner product computation that involves the output vector of SpMxV. Here, we exploit this property to propose a novel parallelization method that avoids the latency costs and synchronization overhead of P2P communications. Our method involves a computational and a communication rearrangement scheme. The computational rearrangement provides an alternative method for forming input vector of SpMxV and allows P2P and collective communications to be performed in a single phase. The communication rearrangement realizes this opportunity by embedding P2P communications into global collective communication operations. The proposed method grants a certain value on the maximum number of messages communicated regardless of the sparsity pattern of the matrix. The downside, however, is the increased message volume and the negligible redundant computation. We favor reducing the message latency costs at the expense of increasing message volume. Yet, we propose two iterative-improvement-based heuristics to alleviate the increase in the volume through one-to-one task-to-processor mapping. Our experiments on two supercomputers, Cray XE6 and IBM BlueGene/Q, up to 2,048 processors show that the proposed parallelization method exhibits superior scalable performance compared to the conventional parallelization method.	en_US
dc.identifier.doi	10.1109/TPDS.2014.2311804	en_US
dc.identifier.issn	1045-9219	en_US
dc.identifier.uri	http://hdl.handle.net/11693/22358	en_US
dc.language.iso	English	en_US
dc.publisher	Institute of Electrical and Electronics Engineers	en_US
dc.relation.isversionof	http://dx.doi.org/10.1109/TPDS.2014.2311804	en_US
dc.source.title	IEEE Transactions on Parallel and Distributed Systems	en_US
dc.subject	Avoiding latency	en_US
dc.subject	Conjugate gradient	en_US
dc.subject	Inner product computation	en_US
dc.subject	Iterative improvement heuristic	en_US
dc.subject	Message latency overhead	en_US
dc.subject	Conjugate gradient method	en_US
dc.subject	Costs	en_US
dc.subject	Matrix algebra	en_US
dc.subject	Parallel processing systems	en_US
dc.subject	Supercomputers	en_US
dc.subject	Vectors	en_US
dc.subject	Avoiding latency	en_US
dc.subject	Collective communications	en_US
dc.subject	Hiding latency	en_US
dc.subject	Inner product	en_US
dc.subject	Iterative improvements	en_US
dc.subject	Iterative solvers	en_US
dc.subject	Message latency	en_US
dc.subject	Point-to-point communication	en_US
dc.subject	Sparse matrix-vector multiplication	en_US
dc.subject	Iterative methods	en_US
dc.title	A novel method for scaling iterative solvers: avoiding latency overhead of parallel sparse-matrix vector multiplies	en_US
dc.type	Article	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: A novel method for scaling iterative solvers Avoiding latency overhead of parallel sparse-matrix vector multiplies.pdf
Size:: 1.73 MB
Format:: Adobe Portable Document Format
Description:: Full printable version

Download

Collections

Scholarly Publications - Computer Engineering