Process variation aware thread mapping for chip multiprocessors
dc.citation.epage | 826 | en_US |
dc.citation.spage | 821 | en_US |
dc.contributor.author | Hong, S. | en_US |
dc.contributor.author | Narayanan, S. H. K. | en_US |
dc.contributor.author | Kandemir, M. | en_US |
dc.contributor.author | Özturk, Özcan | en_US |
dc.coverage.spatial | Nice, France | |
dc.date.accessioned | 2016-02-08T12:28:08Z | |
dc.date.available | 2016-02-08T12:28:08Z | |
dc.date.issued | 2009-04 | en_US |
dc.department | Department of Computer Engineering | en_US |
dc.description | Date of Conference: 20-24 April, 2009 | |
dc.description | Conference name: DATE '09 Proceedings of the Conference on Design, Automation and Test in Europe | |
dc.description.abstract | With the increasing scaling of manufacturing technology, process variation is a phenomenon that has become more prevalent. As a result, in the context of Chip Multiprocessors (CMPs) for example, it is possible that identically-designed processor cores on the chip have non-identical peak frequencies and power consumptions. To cope with such a design, each processor can be assumed to run at the frequency of the slowest processor, resulting in wasted computational capability. This paper considers an alternate approach and proposes an algorithm that intelligently maps (and remaps) computations onto available processors so that each processor runs at its peak frequency. In other words, by dynamically changing the thread-to-processor mapping at runtime, our approach allows each processor to maximize its performance, rather than simply using chip-wide lowest frequency amongst all cores and highest cache latency. Experimental evidence shows that, as compared to a process variation agnostic thread mapping strategy, our proposed scheme achieves as much as 29% improvement in overall execution latency, average improvement being 13% over the benchmarks tested. We also demonstrate in this paper that our savings are consistent across different processor counts, latency maps, and latency distributions.With the increasing scaling of manufacturing technology, process variation is a phenomenon that has become more prevalent. As a result, in the context of Chip Multiprocessors (CMPs) for example, it is possible that identically-designed processor cores on the chip have non-identical peak frequencies and power consumptions. To cope with such a design, each processor can be assumed to run at the frequency of the slowest processor, resulting in wasted computational capability. This paper considers an alternate approach and proposes an algorithm that intelligently maps (and remaps) computations onto available processors so that each processor runs at its peak frequency. In other words, by dynamically changing the thread-to-processor mapping at runtime, our approach allows each processor to maximize its performance, rather than simply using chip-wide lowest frequency amongst all cores and highest cache latency. Experimental evidence shows that, as compared to a process variation agnostic thread mapping strategy, our proposed scheme achieves as much as 29% improvement in overall execution latency, average improvement being 13% over the benchmarks tested. We also demonstrate in this paper that our savings are consistent across different processor counts, latency maps, and latency distributions. © 2009 EDAA. | en_US |
dc.description.provenance | Made available in DSpace on 2016-02-08T12:28:08Z (GMT). No. of bitstreams: 1 bilkent-research-paper.pdf: 70227 bytes, checksum: 26e812c6f5156f83f0e77b261a471b5a (MD5) Previous issue date: 2009 | en |
dc.identifier.doi | 10.1109/DATE.2009.5090776 | en_US |
dc.identifier.uri | http://hdl.handle.net/11693/28728 | en_US |
dc.language.iso | English | en_US |
dc.publisher | IEEE | en_US |
dc.relation.isversionof | https://doi.org/10.1109/DATE.2009.5090776 | |
dc.source.title | DATE '09 Proceedings of the Conference on Design, Automation and Test in Europe | en_US |
dc.subject | Cache latency | en_US |
dc.subject | Chip Multiprocessor | en_US |
dc.subject | Computational capability | en_US |
dc.subject | Experimental evidence | en_US |
dc.subject | Manufacturing technologies | en_US |
dc.subject | Mapping strategy | en_US |
dc.subject | Overall execution | en_US |
dc.subject | Peak frequencies | en_US |
dc.subject | Process variation | en_US |
dc.subject | Processor cores | en_US |
dc.subject | Runtimes | en_US |
dc.subject | Design | en_US |
dc.subject | Electric power utilization | en_US |
dc.subject | Microprocessor chips | en_US |
dc.subject | Multiprocessing systems | en_US |
dc.subject | Systems analysis | en_US |
dc.subject | Mapping | en_US |
dc.title | Process variation aware thread mapping for chip multiprocessors | en_US |
dc.type | Conference Paper | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Process_variation_aware_thread_mapping_for_Chip_Multiprocessors.pdf
- Size:
- 567.39 KB
- Format:
- Adobe Portable Document Format
- Description: