A highly efficient recurrent neural network architecture for data regression

Date

2018

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

Source Title

2018 26th Signal Processing and Communications Applications Conference (SIU)

Print ISSN

Electronic ISSN

Publisher

IEEE

Volume

Issue

Pages

Language

Turkish

Journal Title

Journal ISSN

Volume Title

Series

Abstract

In this paper, we study online nonlinear data regression and propose a highly efficient long short term memory (LSTM) network based architecture. Here, we also introduce on-line training algorithms to learn the parameters of the introduced architecture. We first propose an LSTM based architecture for data regression. To diminish the complexity of this architecture, we use an energy efficient operator (ef-operator) instead of the multiplication operation. We then factorize the matrices of the LSTM network to reduce the total number of parameters to be learned. In order to train the parameters of this structure, we introduce online learning methods based on the exponentiated gradient (EG) and stochastic gradient descent (SGD) algorithms. Experimental results demonstrate considerable performance and efficiency improvements provided by the introduced architecture.

Course

Other identifiers

Book Title

Citation