Selective replicated declustering for arbitrary queries

Oktay, K. Yasin; Türk, Ata; Aykanat, Cevdet

Selective replicated declustering for arbitrary queries

buir.contributor.author	Aykanat, Cevdet
dc.citation.epage	386	en_US
dc.citation.spage	375	en_US
dc.contributor.author	Oktay, K. Yasin	en_US
dc.contributor.author	Türk, Ata	en_US
dc.contributor.author	Aykanat, Cevdet	en_US
dc.coverage.spatial	Delft, The Netherlands
dc.date.accessioned	2016-02-08T12:27:25Z
dc.date.available	2016-02-08T12:27:25Z
dc.date.issued	2009-08	en_US
dc.department	Department of Computer Engineering	en_US
dc.description	Date of Conference: 25-28 August, 2009
dc.description	Conference name: European Conference on Parallel Processing. Euro-Par 2009: Euro-Par 2009 Parallel Processing
dc.description.abstract	Data declustering is used to minimize query response times in data intensive applications. In this technique, query retrieval process is parallelized by distributing the data among several disks and it is useful in applications such as geographic information systems that access huge amounts of data. Declustering with replication is an extension of declustering with possible data replicas in the system. Many replicated declustering schemes have been proposed. Most of these schemes generate two or more copies of all data items. However, some applications have very large data sizes and even having two copies of all data items may not be feasible. In such systems selective replication is a necessity. Furthermore, existing replication schemes are not designed to utilize query distribution information if such information is available. In this study we propose a replicated declustering scheme that decides both on the data items to be replicated and the assignment of all data items to disks when there is limited replication capacity. We make use of available query information in order to decide replication and partitioning of the data and try to optimize aggregate parallel response time. We propose and implement a Fiduccia-Mattheyses-like iterative improvement algorithm to obtain a two-way replicated declustering and use this algorithm in a recursive framework to generate a multi-way replicated declustering. Experiments conducted with arbitrary queries on real datasets show that, especially for low replication constraints, the proposed scheme yields better performance results compared to existing replicated declustering schemes. © 2009 Springer.	en_US
dc.identifier.doi	10.1007/978-3-642-03869-3_37	en_US
dc.identifier.uri	http://hdl.handle.net/11693/28697	en_US
dc.language.iso	English	en_US
dc.publisher	Springer	en_US
dc.relation.isversionof	http://dx.doi.org/10.1007/978-3-642-03869-3_37	en_US
dc.source.title	European Conference on Parallel Processing. Euro-Par 2009: Euro-Par 2009 Parallel Processing	en_US
dc.subject	Data declustering	en_US
dc.subject	Data items	en_US
dc.subject	Data replica	en_US
dc.subject	Data-intensive application	en_US
dc.subject	Declustering	en_US
dc.subject	Declustering scheme	en_US
dc.subject	Iterative improvements	en_US
dc.subject	Query distributions	en_US
dc.subject	Query information	en_US
dc.subject	Query response	en_US
dc.subject	Query retrieval	en_US
dc.subject	Real data sets	en_US
dc.subject	Response time	en_US
dc.subject	Selective replication	en_US
dc.subject	Very large datum	en_US
dc.subject	Artificial intelligence	en_US
dc.subject	Bioinformatics	en_US
dc.subject	Disks (machine components)	en_US
dc.subject	Disks (structural components)	en_US
dc.subject	Distributed computer systems	en_US
dc.subject	Geographic information systems	en_US
dc.subject	Response time (computer systems)	en_US
dc.title	Selective replicated declustering for arbitrary queries	en_US
dc.type	Conference Paper	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Selective replicated declustering for arbitrary queries.pdf
Size:: 411.74 KB
Format:: Adobe Portable Document Format
Description:: Full printable version

Download

Collections

Scholarly Publications - Computer Engineering