Selective replicated declustering for arbitrary queries
buir.contributor.author | Aykanat, Cevdet | |
dc.citation.epage | 386 | en_US |
dc.citation.spage | 375 | en_US |
dc.contributor.author | Oktay, K. Yasin | en_US |
dc.contributor.author | Türk, Ata | en_US |
dc.contributor.author | Aykanat, Cevdet | en_US |
dc.coverage.spatial | Delft, The Netherlands | |
dc.date.accessioned | 2016-02-08T12:27:25Z | |
dc.date.available | 2016-02-08T12:27:25Z | |
dc.date.issued | 2009-08 | en_US |
dc.department | Department of Computer Engineering | en_US |
dc.description | Date of Conference: 25-28 August, 2009 | |
dc.description | Conference name: European Conference on Parallel Processing. Euro-Par 2009: Euro-Par 2009 Parallel Processing | |
dc.description.abstract | Data declustering is used to minimize query response times in data intensive applications. In this technique, query retrieval process is parallelized by distributing the data among several disks and it is useful in applications such as geographic information systems that access huge amounts of data. Declustering with replication is an extension of declustering with possible data replicas in the system. Many replicated declustering schemes have been proposed. Most of these schemes generate two or more copies of all data items. However, some applications have very large data sizes and even having two copies of all data items may not be feasible. In such systems selective replication is a necessity. Furthermore, existing replication schemes are not designed to utilize query distribution information if such information is available. In this study we propose a replicated declustering scheme that decides both on the data items to be replicated and the assignment of all data items to disks when there is limited replication capacity. We make use of available query information in order to decide replication and partitioning of the data and try to optimize aggregate parallel response time. We propose and implement a Fiduccia-Mattheyses-like iterative improvement algorithm to obtain a two-way replicated declustering and use this algorithm in a recursive framework to generate a multi-way replicated declustering. Experiments conducted with arbitrary queries on real datasets show that, especially for low replication constraints, the proposed scheme yields better performance results compared to existing replicated declustering schemes. © 2009 Springer. | en_US |
dc.description.provenance | Made available in DSpace on 2016-02-08T12:27:25Z (GMT). No. of bitstreams: 1 bilkent-research-paper.pdf: 70227 bytes, checksum: 26e812c6f5156f83f0e77b261a471b5a (MD5) Previous issue date: 2009 | en |
dc.identifier.doi | 10.1007/978-3-642-03869-3_37 | en_US |
dc.identifier.uri | http://hdl.handle.net/11693/28697 | en_US |
dc.language.iso | English | en_US |
dc.publisher | Springer | en_US |
dc.relation.isversionof | http://dx.doi.org/10.1007/978-3-642-03869-3_37 | en_US |
dc.source.title | European Conference on Parallel Processing. Euro-Par 2009: Euro-Par 2009 Parallel Processing | en_US |
dc.subject | Data declustering | en_US |
dc.subject | Data items | en_US |
dc.subject | Data replica | en_US |
dc.subject | Data-intensive application | en_US |
dc.subject | Declustering | en_US |
dc.subject | Declustering scheme | en_US |
dc.subject | Iterative improvements | en_US |
dc.subject | Query distributions | en_US |
dc.subject | Query information | en_US |
dc.subject | Query response | en_US |
dc.subject | Query retrieval | en_US |
dc.subject | Real data sets | en_US |
dc.subject | Response time | en_US |
dc.subject | Selective replication | en_US |
dc.subject | Very large datum | en_US |
dc.subject | Artificial intelligence | en_US |
dc.subject | Bioinformatics | en_US |
dc.subject | Disks (machine components) | en_US |
dc.subject | Disks (structural components) | en_US |
dc.subject | Distributed computer systems | en_US |
dc.subject | Geographic information systems | en_US |
dc.subject | Response time (computer systems) | en_US |
dc.title | Selective replicated declustering for arbitrary queries | en_US |
dc.type | Conference Paper | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Selective replicated declustering for arbitrary queries.pdf
- Size:
- 411.74 KB
- Format:
- Adobe Portable Document Format
- Description:
- Full printable version