Online training of LSTM networks in distributed systems for variable length data sequences

Ergen, T.; Kozat, Serdar

Online training of LSTM networks in distributed systems for variable length data sequences

Files

Online_training-of_LSTM_networks_in_distributed_systems_for_variable_lenght_data_sequences.pdf (683.84 KB)

Date

2018

Authors

Ergen, T.

Kozat, Serdar

BUIR Usage Stats

1
views

52
downloads

Citation Stats

Abstract

In this brief, we investigate online training of long short term memory (LSTM) architectures in a distributed network of nodes, where each node employs an LSTM-based structure for online regression. In particular, each node sequentially receives a variable length data sequence with its label and can only exchange information with its neighbors to train the LSTM architecture. We first provide a generic LSTM-based regression structure for each node. In order to train this structure, we put the LSTM equations in a nonlinear state-space form for each node and then introduce a highly effective and efficient distributed particle filtering (DPF)-based training algorithm. We also introduce a distributed extended Kalman filtering-based training algorithm for comparison. Here, our DPF-based training algorithm guarantees convergence to the performance of the optimal LSTM coefficients in the mean square error sense under certain conditions. We achieve this performance with communication and computational complexity in the order of the first-order gradient-based methods. Through both simulated and real-life examples, we illustrate significant performance improvements with respect to the state-of-The-Art methods.

Source Title

IEEE Transactions on Neural Networks and Learning Systems

Publisher

Institute of Electrical and Electronics Engineers

Keywords

Distributed learning, Extended Kalman filtering (EKF), Long short term memory (LSTM) networks, Online learning, Particle filtering

Permalink

http://hdl.handle.net/11693/50274

Published Version (Please cite this version)

https://doi.org/10.1109/TNNLS.2017.2770179

Collections

Scholarly Publications - Electrical and Electronics Engineering

Language

English

Type

Article

Full item page

Online training of LSTM networks in distributed systems for variable length data sequences

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Online training of LSTM networks in distributed systems for variable length data sequences

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type