1.5D parallel sparse matrix-vector multiply
buir.contributor.author | Aykanat, Cevdet | |
dc.citation.epage | C46 | en_US |
dc.citation.issueNumber | 1 | en_US |
dc.citation.spage | C25 | en_US |
dc.citation.volumeNumber | 40 | en_US |
dc.contributor.author | Kayaaslan, E. | en_US |
dc.contributor.author | Aykanat, Cevdet | en_US |
dc.contributor.author | Uçar, B. | en_US |
dc.date.accessioned | 2019-01-23T12:25:29Z | |
dc.date.available | 2019-01-23T12:25:29Z | |
dc.date.issued | 2018 | en_US |
dc.department | Department of Computer Engineering | en_US |
dc.description.abstract | There are three common parallel sparse matrix-vector multiply algorithms: 1D row-parallel, 1D column-parallel, and 2D row-column-parallel. The 1D parallel algorithms offer the advantage of having only one communication phase. On the other hand, the 2D parallel algorithm is more scalable, but it suffers from two communication phases. Here, we introduce a novel concept of heterogeneous messages where a heterogeneous message may contain both input-vector entries and partially computed output-vector entries. This concept not only leads to a decreased number of messages but also enables fusing the input- and output-communication phases into a single phase. These findings are exploited to propose a 1.5D parallel sparse matrix-vector multiply algorithm which is called local row-column-parallel. This proposed algorithm requires a constrained fine-grain partitioning in which each fine-grain task is assigned to the processor that contains either its input-vector entry, its output-vector entry, or both. We propose two methods to carry out the constrained fine-grain partitioning. We conduct our experiments on a large set of test matrices to evaluate the partitioning qualities and partitioning times of these proposed 1.5D methods. | en_US |
dc.description.provenance | Submitted by Elsa Bitri (elsabitri@bilkent.edu.tr) on 2019-01-23T12:25:29Z No. of bitstreams: 1 1.5D_Parallel_Sparse_Matrix-Vector_Multiply.pdf: 6721300 bytes, checksum: 17e14f188b8acd36c236adc8517fdd9a (MD5) | en |
dc.description.provenance | Made available in DSpace on 2019-01-23T12:25:29Z (GMT). No. of bitstreams: 1 1.5D_Parallel_Sparse_Matrix-Vector_Multiply.pdf: 6721300 bytes, checksum: 17e14f188b8acd36c236adc8517fdd9a (MD5) Previous issue date: 2018 | en |
dc.identifier.doi | 10.1137/16M1105591 | en_US |
dc.identifier.eissn | 1095-7197 | en_US |
dc.identifier.issn | 1064-8275 | en_US |
dc.identifier.uri | http://hdl.handle.net/11693/48269 | en_US |
dc.language.iso | English | en_US |
dc.publisher | Society for Industrial and Applied Mathematics | en_US |
dc.relation.isversionof | https://doi.org/10.1137/16M1105591 | en_US |
dc.source.title | SIAM Journal on Scientific Computing | en_US |
dc.subject | Sparse matrix partitioning | en_US |
dc.subject | Parallel sparse matrix-vector multiplication | en_US |
dc.subject | Directed hypergraph model | en_US |
dc.subject | Bipartite vertex cover | en_US |
dc.subject | Combinatorial scientific computing | en_US |
dc.title | 1.5D parallel sparse matrix-vector multiply | en_US |
dc.type | Article | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- 1.5D_Parallel_Sparse_Matrix-Vector_Multiply.pdf
- Size:
- 6.41 MB
- Format:
- Adobe Portable Document Format
- Description:
- Full printable version
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: