Code scheduling for optimizing parallelism and data locality
dc.citation.epage | 216 | en_US |
dc.citation.spage | 204 | en_US |
dc.contributor.author | Yemliha, T. | en_US |
dc.contributor.author | Kandemir, M. | en_US |
dc.contributor.author | Öztürk, Özcan | en_US |
dc.contributor.author | Kultursay, E. | en_US |
dc.contributor.author | Muralidhara, S. P. | en_US |
dc.coverage.spatial | Ischia, Italy | |
dc.date.accessioned | 2016-02-08T12:22:29Z | |
dc.date.available | 2016-02-08T12:22:29Z | |
dc.date.issued | 2010-08-09 | en_US |
dc.department | Department of Computer Engineering | en_US |
dc.description | Date of Conference: 31 August - 3 September, 2010 | |
dc.description | Conference name: Euro-Par: 16th European Conference on Parallel Processing - International Euro-Par Conference | |
dc.description.abstract | As chip multiprocessors proliferate, programming support for these devices is likely to receive a lot of attention in the near future. Parallelism and data locality are two critical issues in a chip multiprocessor environment. Unfortunately, most of the published work in the literature focuses only on one of these problems, and this can prevent one from achieving the best possible performance. The main goal of this paper is to propose and evaluate a compiler-directed code parallelization scheme, which considers both parallelism and data locality at the same time. Our compiler captures the inherent parallelism and data reuse in the application code being analyzed using a novel representation called the locality-parallelism graph (LPG). Our partitioning/scheduling algorithm assigns the nodes of this graph to the processors in the architecture and schedules them for execution. We implemented this algorithm and evaluated its effectiveness using a set of benchmark codes. The results collected so far indicate that our approach improves overall execution latency significantly. In this paper, we also introduce an ILP (Integer Linear Programming) based formulation of the problem, and implement the schedule obtained by the ILP solver. The results indicate that our approach gets within 4% of the ILP solution. © 2010 Springer-Verlag. | en_US |
dc.description.provenance | Made available in DSpace on 2016-02-08T12:22:29Z (GMT). No. of bitstreams: 1 bilkent-research-paper.pdf: 70227 bytes, checksum: 26e812c6f5156f83f0e77b261a471b5a (MD5) Previous issue date: 2010 | en |
dc.identifier.doi | 10.1007/978-3-642-15277-1_20 | en_US |
dc.identifier.issn | 0302-9743 | en_US |
dc.identifier.uri | http://hdl.handle.net/11693/28506 | en_US |
dc.language.iso | English | en_US |
dc.publisher | Springer | en_US |
dc.relation.isversionof | https://doi.org/10.1007/978-3-642-15277-1_20 | en_US |
dc.source.title | Euro-Par 2010 - Parallel Processing -16th International Euro-Par Conference | en_US |
dc.subject | Application codes | en_US |
dc.subject | Benchmark codes | en_US |
dc.subject | Chip multiprocessor | en_US |
dc.subject | Code scheduling | en_US |
dc.subject | Critical issues | en_US |
dc.subject | Data locality | en_US |
dc.subject | Data reuse | en_US |
dc.subject | Inherent parallelism | en_US |
dc.subject | Integer linear programming | en_US |
dc.subject | Overall execution | en_US |
dc.subject | Parallelizations | en_US |
dc.subject | Programming support | en_US |
dc.subject | Computer architecture | en_US |
dc.subject | Integer programming | en_US |
dc.subject | Microprocessor chips | en_US |
dc.subject | Optimization | en_US |
dc.subject | Program compilers | en_US |
dc.subject | Systems analysis | en_US |
dc.subject | Multiprocessing systems | en_US |
dc.title | Code scheduling for optimizing parallelism and data locality | en_US |
dc.type | Conference Paper | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Code scheduling for optimizing parallelism and data locality.pdf
- Size:
- 279.9 KB
- Format:
- Adobe Portable Document Format
- Description:
- Full printable version