Adaptive prefetching for shared cache based chip multiprocessors

Kandemir, M.; Zhang, Y.; Öztürk, Özcan

Adaptive prefetching for shared cache based chip multiprocessors

dc.citation.epage	778	en_US
dc.citation.spage	773	en_US
dc.contributor.author	Kandemir, M.	en_US
dc.contributor.author	Zhang, Y.	en_US
dc.contributor.author	Öztürk, Özcan	en_US
dc.coverage.spatial	Nice, France
dc.date.accessioned	2016-02-08T12:28:11Z
dc.date.available	2016-02-08T12:28:11Z
dc.date.issued	2009-04	en_US
dc.department	Department of Computer Engineering	en_US
dc.description	Date of Conference: 20-24 April, 2009
dc.description	Conference name: DATE '09 Proceedings of the Conference on Design, Automation and Test in Europe
dc.description.abstract	Chip multiprocessors (CMPs) present a unique scenario for software data prefetching with subtle tradeoffs between memory bandwidth and performance. In a shared L2 based CMP, multiple cores compete for the shared on-chip cache space and limited off-chip pin bandwidth. Purely software based prefetching techniques tend to increase this contention, leading to degradation in performance. In some cases, prefetches can become harmful by kicking out useful data from the shared cache whose next usage is earlier than the prefetched data, and the fraction of such harmful prefetches usually increases when we increase the number of cores used for executing a multi-threaded application code. In this paper, we propose two complementary techniques to address the problem of harmful prefetches in the context of shared L2 based CMPs. These techniques, namely, suppressing select data prefetches (if they are found to be harmful) and pinning select data in the L2 cache (if they are found to be frequent victim of harmful prefetches), are evaluated in this paper using two embedded application codes. Our experiments demonstrate that these two techniques are very effective in mitigating the impact of harmful prefetches, and as a result, we extract significant benefits from software prefetching even with large core counts. © 2009 EDAA.	en_US
dc.identifier.doi	10.1109/DATE.2009.5090768	en_US
dc.identifier.uri	http://hdl.handle.net/11693/28731	en_US
dc.language.iso	English	en_US
dc.publisher	IEEE	en_US
dc.relation.isversionof	https://doi.org/10.1109/DATE.2009.5090768
dc.source.title	Proceedings -Design, Automation and Test in Europe, DATE'09	en_US
dc.subject	Chip Multiprocessor	en_US
dc.subject	Embedded application	en_US
dc.subject	L2 Cache	en_US
dc.subject	Large core	en_US
dc.subject	Memory bandwidths	en_US
dc.subject	Multi-threaded application	en_US
dc.subject	Off-chip	en_US
dc.subject	On-chip cache	en_US
dc.subject	Prefetches	en_US
dc.subject	Prefetching	en_US
dc.subject	Prefetching techniques	en_US
dc.subject	Shared cache	en_US
dc.subject	Software data	en_US
dc.subject	Software-based	en_US
dc.subject	Computer software	en_US
dc.subject	Microprocessor chips	en_US
dc.subject	Systems analysis	en_US
dc.subject	Multiprocessing systems	en_US
dc.title	Adaptive prefetching for shared cache based chip multiprocessors	en_US
dc.type	Conference Paper	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Adaptive prefetching for shared cache based chip multiprocessors.pdf
Size:: 282.92 KB
Format:: Adobe Portable Document Format
Description:: Full printable version

Download

Collections

Scholarly Publications - Computer Engineering