Energy-Efficient LSTM networks for online learning
buir.contributor.author | Mirza, Ali H. | |
buir.contributor.author | Kozat, Süleyman Serdar | |
dc.citation.epage | 3126 | en_US |
dc.citation.issueNumber | 8 | en_US |
dc.citation.spage | 3114 | en_US |
dc.citation.volumeNumber | 31 | en_US |
dc.contributor.author | Ergen, T. | |
dc.contributor.author | Mirza, Ali H. | |
dc.contributor.author | Kozat, Süleyman Serdar | |
dc.date.accessioned | 2021-02-18T11:12:44Z | |
dc.date.available | 2021-02-18T11:12:44Z | |
dc.date.issued | 2020 | |
dc.department | Department of Electrical and Electronics Engineering | en_US |
dc.description.abstract | We investigate variable-length data regression in an online setting and introduce an energy-efficient regression structure build on long short-term memory (LSTM) networks. For this structure, we also introduce highly effective online training algorithms. We first provide a generic LSTM-based regression structure for variable-length input sequences. To reduce the complexity of this structure, we then replace the regular multiplication operations with an energy-efficient operator, i.e., the ef-operator. To further reduce the complexity, we apply factorizations to the weight matrices in the LSTM network so that the total number of parameters to be trained is significantly reduced. We then introduce online training algorithms based on the stochastic gradient descent (SGD) and exponentiated gradient (EG) algorithms to learn the parameters of the introduced network. Thus, we obtain highly efficient and effective online learning algorithms based on the LSTM network. Thanks to our generic approach, we also provide and simulate an energy-efficient gated recurrent unit (GRU) network in our experiments. Through an extensive set of experiments, we illustrate significant performance gains and complexity reductions achieved by the introduced algorithms with respect to the conventional methods. | en_US |
dc.description.sponsorship | This work was supported in part by the Tubitak Project under Grant 117E153. | en_US |
dc.identifier.doi | 10.1109/TNNLS.2019.2935796 | en_US |
dc.identifier.issn | 2162-237X | |
dc.identifier.uri | http://hdl.handle.net/11693/75451 | |
dc.language.iso | English | en_US |
dc.publisher | IEEE | en_US |
dc.relation.isversionof | https://dx.doi.org/10.1109/TNNLS.2019.2935796 | en_US |
dc.source.title | IEEE Transactions on Neural Networks and Learning Systems | en_US |
dc.subject | Ef-operator | en_US |
dc.subject | Exponentiated gradient (EG) | en_US |
dc.subject | Gradient descent | en_US |
dc.subject | Long short-term memory (LSTM) | en_US |
dc.subject | Matrix factorization | en_US |
dc.title | Energy-Efficient LSTM networks for online learning | en_US |
dc.type | Article | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Energy-Efficient_LSTM_Networks_for_Online_Learning.pdf
- Size:
- 1.89 MB
- Format:
- Adobe Portable Document Format
- Description:
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: