Adaptive prefetching for shared cache based chip multiprocessors

Kandemir, M.; Zhang, Y.; Öztürk, Özcan

Adaptive prefetching for shared cache based chip multiprocessors

Files

Adaptive prefetching for shared cache based chip multiprocessors.pdf (282.92 KB)

Date

2009-04

Authors

Kandemir, M.

Zhang, Y.

Öztürk, Özcan

Citation Stats

Abstract

Chip multiprocessors (CMPs) present a unique scenario for software data prefetching with subtle tradeoffs between memory bandwidth and performance. In a shared L2 based CMP, multiple cores compete for the shared on-chip cache space and limited off-chip pin bandwidth. Purely software based prefetching techniques tend to increase this contention, leading to degradation in performance. In some cases, prefetches can become harmful by kicking out useful data from the shared cache whose next usage is earlier than the prefetched data, and the fraction of such harmful prefetches usually increases when we increase the number of cores used for executing a multi-threaded application code. In this paper, we propose two complementary techniques to address the problem of harmful prefetches in the context of shared L2 based CMPs. These techniques, namely, suppressing select data prefetches (if they are found to be harmful) and pinning select data in the L2 cache (if they are found to be frequent victim of harmful prefetches), are evaluated in this paper using two embedded application codes. Our experiments demonstrate that these two techniques are very effective in mitigating the impact of harmful prefetches, and as a result, we extract significant benefits from software prefetching even with large core counts. © 2009 EDAA.

Source Title

Proceedings -Design, Automation and Test in Europe, DATE'09

Publisher

IEEE

Keywords

Chip Multiprocessor, Embedded application, L2 Cache, Large core, Memory bandwidths, Multi-threaded application, Off-chip, On-chip cache, Prefetches, Prefetching, Prefetching techniques, Shared cache, Software data, Software-based, Computer software, Microprocessor chips, Systems analysis, Multiprocessing systems

Permalink

http://hdl.handle.net/11693/28731

Published Version (Please cite this version)

https://doi.org/10.1109/DATE.2009.5090768

Collections

Scholarly Publications - Computer Engineering

Language

English

Type

Conference Paper

Full item page

Adaptive prefetching for shared cache based chip multiprocessors

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type