Efficient online learning algorithms based on LSTM neural networks

Ergen, Tolga; Kozat, Süleyman Serdar

Efficient online learning algorithms based on LSTM neural networks

Files

Efficient-Online-Learning-Algorithms-Based-on-LSTM-Neural-Networks.pdf (1.34 MB)

Date

2018

Authors

Ergen, Tolga

Kozat, Süleyman Serdar

BUIR Usage Stats

1
views

250
downloads

Citation Stats

Abstract

We investigate online nonlinear regression and introduce novel regression structures based on the long short term memory (LSTM) networks. For the introduced structures, we also provide highly efficient and effective online training methods. To train these novel LSTM-based structures, we put the underlying architecture in a state space form and introduce highly efficient and effective particle filtering (PF)-based updates. We also provide stochastic gradient descent and extended Kalman filter-based updates. Our PF-based training method guarantees convergence to the optimal parameter estimation in the mean square error sense provided that we have a sufficient number of particles and satisfy certain technical conditions. More importantly, we achieve this performance with a computational complexity in the order of the first-order gradient-based methods by controlling the number of particles. Since our approach is generic, we also introduce a gated recurrent unit (GRU)-based approach by directly replacing the LSTM architecture with the GRU architecture, where we demonstrate the superiority of our LSTM-based approach in the sequential prediction task via different real life data sets. In addition, the experimental results illustrate significant performance improvements achieved by the introduced algorithms with respect to the conventional methods over several different benchmark real life data sets.

Source Title

IEEE Transactions on Neural Networks and Learning Systems

Publisher

Institute of Electrical and Electronics Engineers

Keywords

Gated recurrent unit (GRU), Kalman filtering, Long short term memory (LSTM), Online learning, Particle filtering (PF), Regression, Stochastic gradient descent (SGD)

Permalink

http://hdl.handle.net/11693/50272

Published Version (Please cite this version)

https://doi.org/10.1109/TNNLS.2017.2741598

Collections

Scholarly Publications - Electrical and Electronics Engineering

Language

English

Type

Article

Full item page

Efficient online learning algorithms based on LSTM neural networks

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Efficient online learning algorithms based on LSTM neural networks

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Citation Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type