Efficient online learning with improved LSTM neural networks

Mirza, Ali H.; Kerpiçci, Mine; Kozat, Süleyman S.

Efficient online learning with improved LSTM neural networks

buir.contributor.author	Mirza, Ali H.
buir.contributor.author	Kerpiçci, Mine
buir.contributor.author	Kozat, Süleyman S.
dc.citation.volumeNumber	102	en_US
dc.contributor.author	Mirza, Ali H.
dc.contributor.author	Kerpiçci, Mine
dc.contributor.author	Kozat, Süleyman S.
dc.date.accessioned	2021-02-20T18:01:11Z
dc.date.available	2021-02-20T18:01:11Z
dc.date.issued	2020-04-14
dc.department	Department of Electrical and Electronics Engineering	en_US
dc.description.abstract	We introduce efficient online learning algorithms based on the Long Short Term Memory (LSTM) networks that employ the covariance information. In particular, we introduce the covariance of the present and one-time step past input vectors into the gating structure of the LSTM networks. Additionally, we include the covariance of the output vector, and we learn their weight matrices to improve the learning performance of the LSTM networks where we also provide their updates. We reduce the number of system parameters through the weight matrix factorization where we convert the LSTM weight matrices into two smaller matrices in order to achieve high learning performance with low computational complexity. Moreover, we apply the introduced approach to the Gated Recurrent Unit (GRU) architecture. In our experiments, we illustrate significant performance improvements achieved by our methods on real-life datasets with respect to the vanilla LSTM and vanilla GRU networks.	en_US
dc.embargo.release	2022-04-14
dc.identifier.doi	10.1016/j.dsp.2020.102742	en_US
dc.identifier.issn	1051-2004
dc.identifier.uri	http://hdl.handle.net/11693/75516
dc.language.iso	English	en_US
dc.publisher	Elsevier	en_US
dc.relation.isversionof	https://doi.org/10.1016/j.dsp.2020.102742	en_US
dc.source.title	Digital Signal Processing: A Review Journal	en_US
dc.subject	Online learning	en_US
dc.subject	LSTM	en_US
dc.subject	Covariance	en_US
dc.subject	Weight matrix factorization	en_US
dc.title	Efficient online learning with improved LSTM neural networks	en_US
dc.type	Article	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Efficient_online_learning_with_improved_LSTM_neural_networks.pdf
Size:: 1 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Scholarly Publications - Electrical and Electronics Engineering