Increasing data reuse in parallel sparse matrix-vector and matrix-transpose-vector multiply on shared-memory architectures

Karsavuran, Mustafa Ozan

Increasing data reuse in parallel sparse matrix-vector and matrix-transpose-vector multiply on shared-memory architectures

buir.advisor	Aykanat, Cevdet
dc.contributor.author	Karsavuran, Mustafa Ozan
dc.date.accessioned	2016-01-08T20:18:19Z
dc.date.available	2016-01-08T20:18:19Z
dc.date.issued	2014
dc.description	Cataloged from PDF version of article.	en_US
dc.description	Includes bibliographical references leaves 44-48.	en_US
dc.description.abstract	Sparse matrix-vector and matrix-transpose-vector multiplications (Sparse AAT x) are the kernel operations used in iterative solvers. Sparsity pattern of the input matrix A, as well as its transpose, remains the same throughout the iterations. CPU cache could not be used properly during these Sparse AAT x operations due to irregular sparsity pattern of the matrix. We propose two parallelization strategies for Sparse AAT x. Our methods partition A matrix in order to exploit cache locality for matrix nonzeros and vector entries. We conduct experiments on the recently-released Intel R Xeon PhiTM coprocessor involving large variety of sparse matrices. Experimental results show that proposed methods achieve higher performance improvement than the state-of-the-art methods in the literature.	en_US
dc.description.statementofresponsibility	Karsavuran, Mustafa Ozan	en_US
dc.embargo.release	2016-09-05
dc.format.extent	x, 48 leaves, graphics	en_US
dc.identifier.itemid	B148325
dc.identifier.uri	http://hdl.handle.net/11693/18330
dc.language.iso	English	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Intel Many Integrated Core Architecture (Intel MIC)	en_US
dc.subject	Intel Xeon Phi	en_US
dc.subject	Cache Locality	en_US
dc.subject	Sparse Matrix	en_US
dc.subject	Sparse Matrix-Vector Multiplication	en_US
dc.subject	Sparse Matrix-Vector and Matrix-Transpose-Vector Multiplication	en_US
dc.subject	Hypergraph Model	en_US
dc.subject	Hypergraph Partitioning	en_US
dc.subject.lcc	QA76.88 .K37 2014	en_US
dc.subject.lcsh	Computer architecture.	en_US
dc.subject.lcsh	High performance computing.	en_US
dc.subject.lcsh	Distributed shared memory.	en_US
dc.subject.lcsh	Computer programming.	en_US
dc.title	Increasing data reuse in parallel sparse matrix-vector and matrix-transpose-vector multiply on shared-memory architectures	en_US
dc.title.alternative	Paylaşılan bellek mimarisinde gerçekleştirilen paralel seyrek matris-vektör ve devrik-matris-vektör çarpımında veri yeniden kullanımını arttırmak	en_US
dc.type	Thesis	en_US
thesis.degree.discipline	Computer Engineering
thesis.degree.grantor	Bilkent University
thesis.degree.level	Master's
thesis.degree.name	MS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 10049370.pdf.pdf
Size:: 2.01 MB
Format:: Adobe Portable Document Format
Description:: Full printable version

Download

Collections

Graduate School of Engineering and Science