An efficient and effective second-order training algorithm for LSTM-based adaptive learning

Vural, N. Mert; Ergüt, S.; Kozat, Süleyman S.

An efficient and effective second-order training algorithm for LSTM-based adaptive learning

buir.contributor.author	Vural, N. Mert
buir.contributor.author	Kozat, Süleyman S.
buir.contributor.orcid	Vural, N. Mert\|0000-0002-2820-2806
buir.contributor.orcid	Kozat, Süleyman S.\|000-0002-6488-3848
dc.citation.epage	2554	en_US
dc.citation.spage	2541	en_US
dc.citation.volumeNumber	69	en_US
dc.contributor.author	Vural, N. Mert
dc.contributor.author	Ergüt, S.
dc.contributor.author	Kozat, Süleyman S.
dc.date.accessioned	2022-01-31T13:50:07Z
dc.date.available	2022-01-31T13:50:07Z
dc.date.issued	2021-04-07
dc.department	Department of Electrical and Electronics Engineering	en_US
dc.description.abstract	We study adaptive (or online) nonlinear regression with Long-Short-Term-Memory (LSTM) based networks, i.e., LSTM-based adaptive learning. In this context, we introduce an efficient Extended Kalman filter (EKF) based second-order training algorithm. Our algorithm is truly online, i.e., it does not assume any underlying data generating process and future information, except that the target sequence is bounded. Through an extensive set of experiments, we demonstrate significant performance gains achieved by our algorithm with respect to the state-of-the-art methods. Here, we mainly show that our algorithm consistently provides 10 to 45% improvement in the accuracy compared to the widely-used adaptive methods Adam, RMSprop, and DEKF, and comparable performance to EKF with a 10 to 15 times reduction in the run-time.	en_US
dc.identifier.doi	10.1109/TSP.2021.3071566	en_US
dc.identifier.eissn	1941-0476
dc.identifier.issn	1053-587X
dc.identifier.uri	http://hdl.handle.net/11693/76922
dc.language.iso	English	en_US
dc.publisher	IEEE	en_US
dc.relation.isversionof	https://doi.org/10.1109/TSP.2021.3071566	en_US
dc.source.title	IEEE Transactions on Signal Processing	en_US
dc.subject	Adaptive learning	en_US
dc.subject	Online learning	en_US
dc.subject	Truly online	en_US
dc.subject	Long short term memory (LSTM)	en_US
dc.subject	Kalman filtering	en_US
dc.subject	Regression	en_US
dc.subject	Stochastic gradient descent (SGD) EDICS Category: MLR-SLER	en_US
dc.subject	MLR-DEEP	en_US
dc.title	An efficient and effective second-order training algorithm for LSTM-based adaptive learning	en_US
dc.type	Article	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: An_efficient_and_effective_second-order_training_algorithm_for_LSTM-based_adaptive_learning.pdf
Size:: 2.5 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.69 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Scholarly Publications - Electrical and Electronics Engineering