Increasing data reuse in parallel sparse matrix-vector and matrix-transpose-vector multiply on shared-memory architectures
buir.advisor | Aykanat, Cevdet | |
dc.contributor.author | Karsavuran, Mustafa Ozan | |
dc.date.accessioned | 2016-01-08T20:18:19Z | |
dc.date.available | 2016-01-08T20:18:19Z | |
dc.date.issued | 2014 | |
dc.description | Ankara : The Department of Computer Engineering and the Graduate School of Engineering and Science of Bilkent University, 2014. | en_US |
dc.description | Thesis (Master's) -- Bilkent University, 2014. | en_US |
dc.description | Includes bibliographical references leaves 44-48. | en_US |
dc.description.abstract | Sparse matrix-vector and matrix-transpose-vector multiplications (Sparse AAT x) are the kernel operations used in iterative solvers. Sparsity pattern of the input matrix A, as well as its transpose, remains the same throughout the iterations. CPU cache could not be used properly during these Sparse AAT x operations due to irregular sparsity pattern of the matrix. We propose two parallelization strategies for Sparse AAT x. Our methods partition A matrix in order to exploit cache locality for matrix nonzeros and vector entries. We conduct experiments on the recently-released Intel R Xeon PhiTM coprocessor involving large variety of sparse matrices. Experimental results show that proposed methods achieve higher performance improvement than the state-of-the-art methods in the literature. | en_US |
dc.description.provenance | Made available in DSpace on 2016-01-08T20:18:19Z (GMT). No. of bitstreams: 1 1.pdf: 78510 bytes, checksum: d85492f20c2362aa2bcf4aad49380397 (MD5) | en |
dc.description.statementofresponsibility | Karsavuran, Mustafa Ozan | en_US |
dc.embargo.release | 2016-09-05 | |
dc.format.extent | x, 48 leaves, graphics | en_US |
dc.identifier.itemid | B148325 | |
dc.identifier.uri | http://hdl.handle.net/11693/18330 | |
dc.language.iso | English | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.subject | Intel Many Integrated Core Architecture (Intel MIC) | en_US |
dc.subject | Intel Xeon Phi | en_US |
dc.subject | Cache Locality | en_US |
dc.subject | Sparse Matrix | en_US |
dc.subject | Sparse Matrix-Vector Multiplication | en_US |
dc.subject | Sparse Matrix-Vector and Matrix-Transpose-Vector Multiplication | en_US |
dc.subject | Hypergraph Model | en_US |
dc.subject | Hypergraph Partitioning | en_US |
dc.subject.lcc | QA76.88 .K37 2014 | en_US |
dc.subject.lcsh | Computer architecture. | en_US |
dc.subject.lcsh | High performance computing. | en_US |
dc.subject.lcsh | Distributed shared memory. | en_US |
dc.subject.lcsh | Computer programming. | en_US |
dc.title | Increasing data reuse in parallel sparse matrix-vector and matrix-transpose-vector multiply on shared-memory architectures | en_US |
dc.title.alternative | Paylaşılan bellek mimarisinde gerçekleştirilen paralel seyrek matris-vektör ve devrik-matris-vektör çarpımında veri yeniden kullanımını arttırmak | en_US |
dc.type | Thesis | en_US |
thesis.degree.discipline | Computer Engineering | |
thesis.degree.grantor | Bilkent University | |
thesis.degree.level | Master's | |
thesis.degree.name | MS (Master of Science) |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- 10049370.pdf.pdf
- Size:
- 2.01 MB
- Format:
- Adobe Portable Document Format
- Description:
- Full printable version