Efficient online learning with improved LSTM neural networks

buir.contributor.authorMirza, Ali H.
buir.contributor.authorKerpiçci, Mine
buir.contributor.authorKozat, Süleyman S.
dc.citation.volumeNumber102en_US
dc.contributor.authorMirza, Ali H.
dc.contributor.authorKerpiçci, Mine
dc.contributor.authorKozat, Süleyman S.
dc.date.accessioned2021-02-20T18:01:11Z
dc.date.available2021-02-20T18:01:11Z
dc.date.issued2020-04-14
dc.departmentDepartment of Electrical and Electronics Engineeringen_US
dc.description.abstractWe introduce efficient online learning algorithms based on the Long Short Term Memory (LSTM) networks that employ the covariance information. In particular, we introduce the covariance of the present and one-time step past input vectors into the gating structure of the LSTM networks. Additionally, we include the covariance of the output vector, and we learn their weight matrices to improve the learning performance of the LSTM networks where we also provide their updates. We reduce the number of system parameters through the weight matrix factorization where we convert the LSTM weight matrices into two smaller matrices in order to achieve high learning performance with low computational complexity. Moreover, we apply the introduced approach to the Gated Recurrent Unit (GRU) architecture. In our experiments, we illustrate significant performance improvements achieved by our methods on real-life datasets with respect to the vanilla LSTM and vanilla GRU networks.en_US
dc.embargo.release2022-04-14
dc.identifier.doi10.1016/j.dsp.2020.102742en_US
dc.identifier.issn1051-2004
dc.identifier.urihttp://hdl.handle.net/11693/75516
dc.language.isoEnglishen_US
dc.publisherElsevieren_US
dc.relation.isversionofhttps://doi.org/10.1016/j.dsp.2020.102742en_US
dc.source.titleDigital Signal Processing: A Review Journalen_US
dc.subjectOnline learningen_US
dc.subjectLSTMen_US
dc.subjectCovarianceen_US
dc.subjectWeight matrix factorizationen_US
dc.titleEfficient online learning with improved LSTM neural networksen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Efficient_online_learning_with_improved_LSTM_neural_networks.pdf
Size:
1 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: