Scaling sparse matrix-matrix multiplication in the accumulo database
buir.contributor.author | Demirci, Gündüz Vehbi | |
buir.contributor.author | Aykanat, Cevdet | |
dc.citation.epage | 62 | en_US |
dc.citation.issueNumber | 1 | |
dc.citation.spage | 31 | en_US |
dc.citation.volumeNumber | 38 | |
dc.contributor.author | Demirci, Gündüz Vehbi | en_US |
dc.contributor.author | Aykanat, Cevdet | en_US |
dc.date.accessioned | 2020-02-03T12:48:40Z | |
dc.date.available | 2020-02-03T12:48:40Z | |
dc.date.issued | 2020 | |
dc.department | Department of Computer Engineering | en_US |
dc.description.abstract | We propose and implement a sparse matrix-matrix multiplication (SpGEMM) algorithm running on top of Accumulo’s iterator framework which enables high performance distributed parallelism. The proposed algorithm provides write-locality while ingesting the output matrix back to database via utilizing row-by-row parallel SpGEMM. The proposed solution also alleviates scanning of input matrices multiple times by making use of Accumulo’s batch scanning capability which is used for accessing multiple ranges of key-value pairs in parallel. Even though the use of batch-scanning introduces some latency overheads, these overheads are alleviated by the proposed solution and by using node-level parallelism structures. We also propose a matrix partitioning scheme which reduces the total communication volume and provides a balance of workload among servers. The results of extensive experiments performed on both real-world and synthetic sparse matrices show that the proposed algorithm scales significantly better than the outer-product parallel SpGEMM algorithm available in the Graphulo library. By applying the proposed matrix partitioning, the performance of the proposed algorithm is further improved considerably. | en_US |
dc.description.provenance | Submitted by Zeynep Aykut (zeynepay@bilkent.edu.tr) on 2020-02-03T12:48:39Z No. of bitstreams: 1 Scaling_sparse_matrix-matrix_multiplication_in_the_accumulo_database.pdf: 672671 bytes, checksum: 414b7442709efb5d7906ac883b8bda9d (MD5) | en |
dc.description.provenance | Made available in DSpace on 2020-02-03T12:48:40Z (GMT). No. of bitstreams: 1 Scaling_sparse_matrix-matrix_multiplication_in_the_accumulo_database.pdf: 672671 bytes, checksum: 414b7442709efb5d7906ac883b8bda9d (MD5) Previous issue date: 2019 | en |
dc.identifier.doi | 10.1007/s10619-019-07257-y | en_US |
dc.identifier.issn | 0926-8782 | en_US |
dc.identifier.uri | http://hdl.handle.net/11693/53002 | en_US |
dc.language.iso | English | en_US |
dc.publisher | Springer | en_US |
dc.relation.isversionof | https://dx.doi.org/10.1007/s10619-019-07257-y | en_US |
dc.source.title | Distributed and Parallel Databases | en_US |
dc.subject | Accumulo | en_US |
dc.subject | Data locality | en_US |
dc.subject | Databases | en_US |
dc.subject | Graph partitioning | en_US |
dc.subject | Graphulo | en_US |
dc.subject | Matrix partitioning | en_US |
dc.subject | NoSQL | en_US |
dc.subject | Parallel and distributed computing | en_US |
dc.subject | Sparse matrices | en_US |
dc.subject | Sparse matrix–matrix multiplication | en_US |
dc.subject | SpGEMM | en_US |
dc.title | Scaling sparse matrix-matrix multiplication in the accumulo database | en_US |
dc.type | Article | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Scaling_sparse_matrix_matrix_multiplication_in_the_accumulo_database.pdf
- Size:
- 646.85 KB
- Format:
- Adobe Portable Document Format
- Description:
- View / Download