Parallel sparse matrix vector multiplication techniques for shared memory architectures

buir.advisorAykanat, Cevdet
dc.contributor.authorBaşaran, Mehmet
dc.date.accessioned2016-01-08T20:02:01Z
dc.date.available2016-01-08T20:02:01Z
dc.date.issued2014
dc.descriptionAnkara : The Department of Computer Engineering and the Graduate School of Engineering and Science of Bilkent University, 2014.en_US
dc.descriptionThesis (Master's) -- Bilkent University, 2014.en_US
dc.descriptionIncludes bibliographical references leaves 97-99.en_US
dc.description.abstractSpMxV (Sparse matrix vector multiplication) is a kernel operation in linear solvers in which a sparse matrix is multiplied with a dense vector repeatedly. Due to random memory access patterns exhibited by SpMxV operation, hardware components such as prefetchers, CPU caches, and built in SIMD units are under-utilized. Consequently, limiting parallelization efficieny. In this study we developed; • an adaptive runtime scheduling and load balancing algorithms for shared memory systems, • a hybrid storage format to help effectively vectorize sub-matrices, • an algorithm to extract proposed hybrid sub-matrix storage format. Implemented techniques are designed to be used by both hypergraph partitioning powered and spontaneous SpMxV operations. Tests are carried out on Knights Corner (KNC) coprocessor which is an x86 based many-core architecture employing NoC (network on chip) communication subsystem. However, proposed techniques can also be implemented for GPUs (graphical processing units).en_US
dc.description.provenanceMade available in DSpace on 2016-01-08T20:02:01Z (GMT). No. of bitstreams: 1 0006724.pdf: 569118 bytes, checksum: a19dc45790045107874b24e960d428ba (MD5)en
dc.description.statementofresponsibilityBaşaran, Mehmeten_US
dc.format.extentxvi, 99 leaves, graphicsen_US
dc.identifier.itemidB148387
dc.identifier.urihttp://hdl.handle.net/11693/16868
dc.language.isoEnglishen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectSpMxVen_US
dc.subjectParallelizationen_US
dc.subjectKNCen_US
dc.subjectIntel Xeon Phien_US
dc.subjectMany-coreen_US
dc.subjectGPUen_US
dc.subjectVectorizationen_US
dc.subjectSIMDen_US
dc.subjectAdaptive scheduling and load balancingen_US
dc.subjectWork stealingen_US
dc.subjectDistributed Systemsen_US
dc.subjectData Localityen_US
dc.subject.lccQA188 .B37 2014en_US
dc.subject.lcshSparse matrices.en_US
dc.subject.lcshSparse matrices--Data processing.en_US
dc.titleParallel sparse matrix vector multiplication techniques for shared memory architecturesen_US
dc.typeThesisen_US
thesis.degree.disciplineComputer Engineering
thesis.degree.grantorBilkent University
thesis.degree.levelMaster's
thesis.degree.nameMS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
0006724.pdf
Size:
555.78 KB
Format:
Adobe Portable Document Format