Code scheduling for optimizing parallelism and data locality

Yemliha, T.; Kandemir, M.; Öztürk, Özcan; Kultursay, E.; Muralidhara, S. P.

Code scheduling for optimizing parallelism and data locality

dc.citation.epage	216	en_US
dc.citation.spage	204	en_US
dc.contributor.author	Yemliha, T.	en_US
dc.contributor.author	Kandemir, M.	en_US
dc.contributor.author	Öztürk, Özcan	en_US
dc.contributor.author	Kultursay, E.	en_US
dc.contributor.author	Muralidhara, S. P.	en_US
dc.coverage.spatial	Ischia, Italy
dc.date.accessioned	2016-02-08T12:22:29Z
dc.date.available	2016-02-08T12:22:29Z
dc.date.issued	2010-08-09	en_US
dc.department	Department of Computer Engineering	en_US
dc.description	Date of Conference: 31 August - 3 September, 2010
dc.description	Conference name: Euro-Par: 16th European Conference on Parallel Processing - International Euro-Par Conference
dc.description.abstract	As chip multiprocessors proliferate, programming support for these devices is likely to receive a lot of attention in the near future. Parallelism and data locality are two critical issues in a chip multiprocessor environment. Unfortunately, most of the published work in the literature focuses only on one of these problems, and this can prevent one from achieving the best possible performance. The main goal of this paper is to propose and evaluate a compiler-directed code parallelization scheme, which considers both parallelism and data locality at the same time. Our compiler captures the inherent parallelism and data reuse in the application code being analyzed using a novel representation called the locality-parallelism graph (LPG). Our partitioning/scheduling algorithm assigns the nodes of this graph to the processors in the architecture and schedules them for execution. We implemented this algorithm and evaluated its effectiveness using a set of benchmark codes. The results collected so far indicate that our approach improves overall execution latency significantly. In this paper, we also introduce an ILP (Integer Linear Programming) based formulation of the problem, and implement the schedule obtained by the ILP solver. The results indicate that our approach gets within 4% of the ILP solution. © 2010 Springer-Verlag.	en_US
dc.identifier.doi	10.1007/978-3-642-15277-1_20	en_US
dc.identifier.issn	0302-9743	en_US
dc.identifier.uri	http://hdl.handle.net/11693/28506	en_US
dc.language.iso	English	en_US
dc.publisher	Springer	en_US
dc.relation.isversionof	https://doi.org/10.1007/978-3-642-15277-1_20	en_US
dc.source.title	Euro-Par 2010 - Parallel Processing -16th International Euro-Par Conference	en_US
dc.subject	Application codes	en_US
dc.subject	Benchmark codes	en_US
dc.subject	Chip multiprocessor	en_US
dc.subject	Code scheduling	en_US
dc.subject	Critical issues	en_US
dc.subject	Data locality	en_US
dc.subject	Data reuse	en_US
dc.subject	Inherent parallelism	en_US
dc.subject	Integer linear programming	en_US
dc.subject	Overall execution	en_US
dc.subject	Parallelizations	en_US
dc.subject	Programming support	en_US
dc.subject	Computer architecture	en_US
dc.subject	Integer programming	en_US
dc.subject	Microprocessor chips	en_US
dc.subject	Optimization	en_US
dc.subject	Program compilers	en_US
dc.subject	Systems analysis	en_US
dc.subject	Multiprocessing systems	en_US
dc.title	Code scheduling for optimizing parallelism and data locality	en_US
dc.type	Conference Paper	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Code scheduling for optimizing parallelism and data locality.pdf
Size:: 279.9 KB
Format:: Adobe Portable Document Format
Description:: Full printable version

Download

Collections

Scholarly Publications - Computer Engineering