Performance improvement on latency-bound parallel HPC applications by message sharing between processors
buir.advisor | Aykanat, Cevdet | |
dc.contributor.author | Duymuş, Mustafa | |
dc.date.accessioned | 2021-02-15T08:28:50Z | |
dc.date.available | 2021-02-15T08:28:50Z | |
dc.date.copyright | 2021-02 | |
dc.date.issued | 2021-02 | |
dc.date.submitted | 2021-02-10 | |
dc.description | Cataloged from PDF version of article. | en_US |
dc.description | Thesis (M.S.): Bilkent University, Department of Computer Engineering, İhsan Doğramacı Bilkent University, 2021. | en_US |
dc.description | Includes bibliographical references (leaves 45-47). | en_US |
dc.description.abstract | The performance of paralellized High Performance Computing (HPC) applica-tions is tied to the efficiency of the underlying processor-to-processor commu-nication. In latency-bound applications, the performance runs into bottleneck by the processor that is sending the maximum number of messages to the other processors. To reduce the latency overhead, we propose a two-phase message-sharing-based algorithm, where the bottleneck processor (the processor sending the maximum number of messages) is paired with another processor. In the first phase, the bottleneck processor is paired with the processor that has the maxi-mum number of common outgoing messages. In the second phase, the bottleneck processor is paired with the processor that has the minimum number of outgo-ing messages. In both phases, the processor pair share the common outgoing messages between them, reducing their total number of outgoing messages, but especially the number of outgoing messages of the bottleneck processor. We use Sparse Matrix-Vector Multiplication as the kernel application and a 512-processor setting for the experiments. The proposed message-sharing algorithm achieves a reduction of 84% in the number of messages sent by the bottleneck processor and a reduction of 60% in the total number of messages in the system. | en_US |
dc.description.provenance | Submitted by Betül Özen (ozen@bilkent.edu.tr) on 2021-02-15T08:28:50Z No. of bitstreams: 1 10379795_.pdf: 756128 bytes, checksum: e98f60fec6f7586b5f82e47865af09b6 (MD5) | en |
dc.description.provenance | Made available in DSpace on 2021-02-15T08:28:50Z (GMT). No. of bitstreams: 1 10379795_.pdf: 756128 bytes, checksum: e98f60fec6f7586b5f82e47865af09b6 (MD5) Previous issue date: 2021-02 | en |
dc.description.statementofresponsibility | by Mustafa Duymuş | en_US |
dc.format.extent | xi, 47 leaves : charts ; 30 cm. | en_US |
dc.identifier.itemid | B150678 | |
dc.identifier.uri | http://hdl.handle.net/11693/55132 | |
dc.language.iso | English | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.subject | High performance computing | en_US |
dc.subject | Parallel applications | en_US |
dc.subject | MPI | en_US |
dc.subject | Store-and-forward algorithms | en_US |
dc.title | Performance improvement on latency-bound parallel HPC applications by message sharing between processors | en_US |
dc.title.alternative | Gecikim-limitli paralel uygulamalarda işlemciler arası mesaj paylaşım yöntemiyle performans iyileştirme | en_US |
dc.type | Thesis | en_US |
thesis.degree.discipline | Computer Engineering | |
thesis.degree.grantor | Bilkent University | |
thesis.degree.level | Master's | |
thesis.degree.name | MS (Master of Science) |