Fast optimal load balancing algorithms for 1D partitioning
buir.contributor.author | Aykanat, Cevdet | |
dc.citation.epage | 996 | en_US |
dc.citation.issueNumber | 8 | en_US |
dc.citation.spage | 974 | en_US |
dc.citation.volumeNumber | 64 | en_US |
dc.contributor.author | Pınar, A. | en_US |
dc.contributor.author | Aykanat, Cevdet | en_US |
dc.date.accessioned | 2016-02-08T10:26:22Z | |
dc.date.available | 2016-02-08T10:26:22Z | en_US |
dc.date.issued | 2004 | en_US |
dc.department | Department of Computer Engineering | en_US |
dc.description.abstract | The one-dimensional decomposition of nonuniform workload arrays with optimal load balancing is investigated. The problem has been studied in the literature as the "chains-on-chains partitioning" problem. Despite the rich literature on exact algorithms, heuristics are still used in parallel computing community with the "hope" of good decompositions and the "myth" of exact algorithms being hard to implement and not runtime efficient. We show that exact algorithms yield significant improvements in load balance over heuristics with negligible overhead. Detailed pseudocodes of the proposed algorithms are provided for reproducibility. We start with a literature review and propose improvements and efficient implementation tips for these algorithms. We also introduce novel algorithms that are asymptotically and runtime efficient. Our experiments on sparse matrix and direct volume rendering datasets verify that balance can be significantly improved by using exact algorithms. The proposed exact algorithms are 100 times faster than a single sparse-matrix vector multiplication for 64-way decompositions on the average. We conclude that exact algorithms with proposed efficient implementations can effectively replace heuristics. © 2004 Elsevier Inc. All rights reserved. | en_US |
dc.description.provenance | Made available in DSpace on 2016-02-08T10:26:22Z (GMT). No. of bitstreams: 1 bilkent-research-paper.pdf: 70227 bytes, checksum: 26e812c6f5156f83f0e77b261a471b5a (MD5) Previous issue date: 2004 | en_US |
dc.identifier.doi | 10.1016/j.jpdc.2004.05.003 | en_US |
dc.identifier.issn | 0743-7315 | |
dc.identifier.issn | 1096-0848 | |
dc.identifier.uri | http://hdl.handle.net/11693/24250 | en_US |
dc.language.iso | English | en_US |
dc.publisher | Academic Press | en_US |
dc.relation.isversionof | http://dx.doi.org/10.1016/j.jpdc.2004.05.003 | en_US |
dc.source.title | Journal of Parallel and Distributed Computing | en_US |
dc.subject | Chains-on-chains partitioning | en_US |
dc.subject | Dynamic programming | en_US |
dc.subject | Image-space parallel volume rendering | en_US |
dc.subject | Iterative refinement | en_US |
dc.subject | One-dimensional partitioning | en_US |
dc.subject | Optimal load balancing | en_US |
dc.subject | Parallel sparse matrix vector multiplication | en_US |
dc.subject | Parametric search | en_US |
dc.subject | Algorithms | en_US |
dc.subject | Dynamic programming | en_US |
dc.subject | Graph theory | en_US |
dc.subject | Heuristic methods | en_US |
dc.subject | Matrix algebra | en_US |
dc.subject | Vectors | en_US |
dc.subject | One-dimensional partitioning | en_US |
dc.subject | Optimal load balancing | en_US |
dc.subject | Parallel sparse matrix vector multiplication | en_US |
dc.subject | Parallel processing systems | en_US |
dc.title | Fast optimal load balancing algorithms for 1D partitioning | en_US |
dc.type | Article | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Fast optimal load balancing algorithms for 1D partitioning.pdf
- Size:
- 564.22 KB
- Format:
- Adobe Portable Document Format
- Description:
- Full printable version