Scaling stratified stochastic gradient descent for distributed matrix completion

Abubaker, Nabil; Karsavuran, M. O.; Aykanat, Cevdet

Scaling stratified stochastic gradient descent for distributed matrix completion

Files

Scaling_stratified_stochastic_gradient_descent_for_distributed_matrix_completion.pdf (1.58 MB)

Date

2023-10-01

Authors

Abubaker, Nabil

Karsavuran, M. O.

Aykanat, Cevdet

BUIR Usage Stats

30
views

39
downloads

Citation Stats

Abstract

Stratified SGD (SSGD) is the primary approach for achieving serializable parallel SGD for matrix completion. State-of-the-art parallelizations of SSGD fail to scale due to large communication overhead. During an SGD epoch, these methods send data proportional to one of the dimensions of the rating matrix. We propose a framework for scalable SSGD through significantly reducing the communication overhead via exchanging point-to-point messages utilizing the sparsity of the rating matrix. We provide formulas to represent the essential communication for correctly performing parallel SSGD and we propose a dynamic programming algorithm for efficiently computing them to establish the point-to-point message schedules. This scheme, however, significantly increases the number of messages sent by a processor per epoch from O(K) to (K2) for a K-processor system which might limit the scalability. To remedy this, we propose a Hold-and-Combine strategy to limit the upper-bound on the number of messages sent per processor to O(KlgK). We also propose a hypergraph partitioning model that correctly encapsulates reducing the communication volume. Experimental results show that the framework successfully achieves a scalable distributed SSGD through significantly reducing the communication overhead. Our code is publicly available at: github.com/nfabubaker/CESSGD

Source Title

IEEE Transactions on Knowledge and Data Engineering

Publisher

Institute of Electrical and Electronics Engineers

Keywords

Bandwidth cost, Combinatorial algorithms, Communication cost minimization, Collaborative filtering, HPC, Hypergraph partitioning, Latency cost, Matrix completion, Recommender systems, SGD

Permalink

https://hdl.handle.net/11693/114918

Published Version (Please cite this version)

https://dx.doi.org/10.1109/TKDE.2023.3253791

Collections

Scholarly Publications - Computer Engineering

Language

English

Type

Article

Full item page

Scaling stratified stochastic gradient descent for distributed matrix completion

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Scaling stratified stochastic gradient descent for distributed matrix completion

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type