Boosting performance of directory-based cache coherence protocols with coherence bypass at subpage granularity and a novel on-chip page table

Soltaniyeh, M.; Kadayıf, I.; Öztürk, Özcan

Boosting performance of directory-based cache coherence protocols with coherence bypass at subpage granularity and a novel on-chip page table

dc.citation.epage	187	en_US
dc.citation.spage	180	en_US
dc.contributor.author	Soltaniyeh, M.	en_US
dc.contributor.author	Kadayıf, I.	en_US
dc.contributor.author	Öztürk, Özcan	en_US
dc.coverage.spatial	Como, Italy
dc.date.accessioned	2018-04-12T11:44:01Z
dc.date.available	2018-04-12T11:44:01Z
dc.date.issued	2016- 05	en_US
dc.department	Department of Computer Engineering	en_US
dc.description	Date of Conference: 16-19 May, 2016
dc.description	Conference name: CF '16 Proceedings of the ACM International Conference on Computing Frontiers
dc.description.abstract	Chip multiprocessors (CMPs) require effective cache coher-ence protocols as well as fast virtual-To-physical address trans-lation mechanisms for high performance. Directory-based cache coherence protocols are the state-of-The-Art approaches in many-core CMPs to keep the data blocks coherent at the last level private caches. However, the area overhead and high associativity requirement of the directory structures may not scale well with increasingly higher number of cores. As shown in some prior studies, a significant percentage of data blocks are accessed by only one core, therefore, it is not necessary to keep track of these in the directory struc-ture. In this study, we have two major contributions. First, we show that compared to the classification of cache blocks at page granularity as done in some previous studies, data block classification at subpage level helps to detect consid-erably more private data blocks. Consequently, it reduces the percentage of blocks required to be tracked in the di-rectory significantly compared to similar page level classification approaches. This, in turn, enables smaller directory caches with lower associativity to be used in CMPs without hurting performance, thereby helping the directory struc-ture to scale gracefully with the increasing number of cores. Memory block classification at subpage level, however, may increase the frequency of the Operating System's (OS) in-volvement in updating the maintenance bits belonging to subpages stored in page table entries, nullifying some por-tion of performance benefits of subpage level data classification. To overcome this, we propose a distributed on-chip page table as a our second contribution. © 2016 Copyright held by the owner/author(s).	en_US
dc.identifier.doi	10.1145/2903150.2903175	en_US
dc.identifier.uri	http://hdl.handle.net/11693/37563	en_US
dc.language.iso	English	en_US
dc.publisher	ACM	en_US
dc.relation.isversionof	https://doi.org/10.1145/2903150.2903175	en_US
dc.source.title	CF '16 Proceedings of the ACM International Conference on Computing Frontiers	en_US
dc.subject	Cache coherence	en_US
dc.subject	Directory cache	en_US
dc.subject	Many-core system	en_US
dc.subject	Page table	en_US
dc.subject	Virtual memory	en_US
dc.subject	Classification (of information)	en_US
dc.subject	Multiprocessing systems	en_US
dc.subject	Physical addresses	en_US
dc.subject	Virtual addresses	en_US
dc.subject	Cache Coherence	en_US
dc.subject	Directory caches	en_US
dc.subject	Many core	en_US
dc.subject	Page table	en_US
dc.subject	Virtual memory	en_US
dc.subject	Cache memory	en_US
dc.title	Boosting performance of directory-based cache coherence protocols with coherence bypass at subpage granularity and a novel on-chip page table	en_US
dc.type	Conference Paper	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Boosting performance of directory-based cache coherence protocols with coherence bypass at subpage granularity and a novel on-chip page table.pdf
Size:: 1.78 MB
Format:: Adobe Portable Document Format
Description:: Full Printable Version

Download

Collections

Scholarly Publications - Computer Engineering