Graph aware caching policy for distributed graph stores
dc.citation.epage | 15 | en_US |
dc.citation.spage | 6 | en_US |
dc.contributor.author | Aksu, Hidayet | en_US |
dc.contributor.author | Canım, M. | en_US |
dc.contributor.author | Chang, Y.-C. | en_US |
dc.contributor.author | Körpeoğlu, İbrahim | en_US |
dc.contributor.author | Ulusoy, Özgür | en_US |
dc.coverage.spatial | Tempe, AZ, USA | |
dc.date.accessioned | 2016-02-08T12:24:11Z | |
dc.date.available | 2016-02-08T12:24:11Z | |
dc.date.issued | 2015-03 | en_US |
dc.department | Department of Computer Engineering | en_US |
dc.description | Conference name: IEEE International Conference on Cloud Engineering, 2015 | |
dc.description | Date of Conference: 9-13 March 2015 | |
dc.description.abstract | Graph stores are becoming increasingly popular among NOSQL applications seeking flexibility and heterogeneity in managing linked data. Conceptually and in practice, applications ranging from social networks, knowledge representations to Internet of things benefit from graph data stores built on a combination of relational and non-relational technologies aimed at desired performance characteristics. The most common data access pattern in querying graph stores is to traverse from a node to its neighboring nodes. This paper studies the impact of such traversal pattern to common data caching policies in a partitioned data environment where a big graph is distributed across servers in a cluster. We propose and evaluate a new graph aware caching policy designed to keep and evict nodes, edges and their metadata optimized for query traversal pattern. The algorithm distinguishes the topology of the graph as well as the latency of access to the graph nodes and neighbors. We implemented graph aware caching on a distributed data store Apache HBase in the Hadoop family. Performance evaluations showed up to 15x speedup on the benchmark datasets preferring our new graph aware policy over non-aware policies. We also show how to improve the performance of existing caching algorithms for distributed graphs by exploiting the topology information. © 2015 IEEE. | en_US |
dc.description.provenance | Made available in DSpace on 2016-02-08T12:24:11Z (GMT). No. of bitstreams: 1 bilkent-research-paper.pdf: 70227 bytes, checksum: 26e812c6f5156f83f0e77b261a471b5a (MD5) Previous issue date: 2015 | en |
dc.identifier.doi | 10.1109/IC2E.2015.39 | en_US |
dc.identifier.uri | http://hdl.handle.net/11693/28574 | en_US |
dc.language.iso | English | en_US |
dc.publisher | IEEE | en_US |
dc.relation.isversionof | http://dx.doi.org/10.1109/IC2E.2015.39 | en_US |
dc.source.title | Proceedings - EEE International Conference on Cloud Engineering, IC2E 2015 | en_US |
dc.subject | Apache HBase | en_US |
dc.subject | Big data analytics | en_US |
dc.subject | Cache | en_US |
dc.subject | Distributed computing | en_US |
dc.subject | Graph aware | en_US |
dc.subject | Benchmarking | en_US |
dc.subject | Big data | en_US |
dc.subject | Data handling | en_US |
dc.subject | Distributed computer systems | en_US |
dc.subject | Knowledge representation | en_US |
dc.subject | Social sciences computing | en_US |
dc.subject | Topology | en_US |
dc.subject | Data analytics | en_US |
dc.subject | Distributed data stores | en_US |
dc.subject | Performance characteristics | en_US |
dc.subject | Topology information | en_US |
dc.subject | Traversal patterns | en_US |
dc.subject | Graph theory | en_US |
dc.title | Graph aware caching policy for distributed graph stores | en_US |
dc.type | Conference Paper | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Graph aware caching policy for distributed graph stores.pdf
- Size:
- 578.73 KB
- Format:
- Adobe Portable Document Format
- Description:
- Full printable version