Energy-Efficient LSTM networks for online learning

Ergen, T.; Mirza, Ali H.; Kozat, Süleyman Serdar

Energy-Efficient LSTM networks for online learning

buir.contributor.author	Mirza, Ali H.
buir.contributor.author	Kozat, Süleyman Serdar
dc.citation.epage	3126	en_US
dc.citation.issueNumber	8	en_US
dc.citation.spage	3114	en_US
dc.citation.volumeNumber	31	en_US
dc.contributor.author	Ergen, T.
dc.contributor.author	Mirza, Ali H.
dc.contributor.author	Kozat, Süleyman Serdar
dc.date.accessioned	2021-02-18T11:12:44Z
dc.date.available	2021-02-18T11:12:44Z
dc.date.issued	2020
dc.department	Department of Electrical and Electronics Engineering	en_US
dc.description.abstract	We investigate variable-length data regression in an online setting and introduce an energy-efficient regression structure build on long short-term memory (LSTM) networks. For this structure, we also introduce highly effective online training algorithms. We first provide a generic LSTM-based regression structure for variable-length input sequences. To reduce the complexity of this structure, we then replace the regular multiplication operations with an energy-efficient operator, i.e., the ef-operator. To further reduce the complexity, we apply factorizations to the weight matrices in the LSTM network so that the total number of parameters to be trained is significantly reduced. We then introduce online training algorithms based on the stochastic gradient descent (SGD) and exponentiated gradient (EG) algorithms to learn the parameters of the introduced network. Thus, we obtain highly efficient and effective online learning algorithms based on the LSTM network. Thanks to our generic approach, we also provide and simulate an energy-efficient gated recurrent unit (GRU) network in our experiments. Through an extensive set of experiments, we illustrate significant performance gains and complexity reductions achieved by the introduced algorithms with respect to the conventional methods.	en_US
dc.description.sponsorship	This work was supported in part by the Tubitak Project under Grant 117E153.	en_US
dc.identifier.doi	10.1109/TNNLS.2019.2935796	en_US
dc.identifier.issn	2162-237X
dc.identifier.uri	http://hdl.handle.net/11693/75451
dc.language.iso	English	en_US
dc.publisher	IEEE	en_US
dc.relation.isversionof	https://dx.doi.org/10.1109/TNNLS.2019.2935796	en_US
dc.source.title	IEEE Transactions on Neural Networks and Learning Systems	en_US
dc.subject	Ef-operator	en_US
dc.subject	Exponentiated gradient (EG)	en_US
dc.subject	Gradient descent	en_US
dc.subject	Long short-term memory (LSTM)	en_US
dc.subject	Matrix factorization	en_US
dc.title	Energy-Efficient LSTM networks for online learning	en_US
dc.type	Article	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Energy-Efficient_LSTM_Networks_for_Online_Learning.pdf
Size:: 1.89 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Department of Electrical and Electronics Engineering