Online learning with recurrent neural networks

Ergen, Tolga

Online learning with recurrent neural networks

buir.advisor	Kozat, Süleyman Serdar
dc.contributor.author	Ergen, Tolga
dc.date.accessioned	2018-07-30T10:58:16Z
dc.date.available	2018-07-30T10:58:16Z
dc.date.copyright	2018-07
dc.date.issued	2018-07
dc.date.submitted	2018-07-17
dc.description	Cataloged from PDF version of article.	en_US
dc.description	Includes bibliographical references (leaves 80-87).	en_US
dc.description.abstract	In this thesis, we study online learning with Recurrent Neural Networks (RNNs). Particularly, in Chapter 2, we investigate online nonlinear regression and introduce novel regression structures based on the Long Short Term Memory (LSTM) network, i.e., is an advanced RNN architecture. To train these novel LSTM based structures, we introduce highly e cient and e ective Particle Filtering (PF) based updates. We also provide Stochastic Gradient Descent (SGD) and Extended Kalman Filter (EKF) based updates. Our PF based training method guarantees convergence to the optimal parameter estimation in the Mean Square Error (MSE) sense. In Chapter 3, we investigate online training of LSTM architectures in a distributed network of nodes, where each node employs an LSTM based structure for online regression. We rst provide a generic LSTM based regression structure for each node. In order to train this structure, we introduce a highly e ective and e cient Distributed PF (DPF) based training algorithm. We also introduce a Distributed EKF (DEKF) based training algorithm. Here, our DPF based training algorithm guarantees convergence to the performance of the optimal centralized LSTM parameters in the MSE sense. In Chapter 4, we investigate variable length data regression in an online setting and introduce an energy e cient regression structure build on LSTM networks. To reduce the complexity of this structure, we rst replace the regular multiplication operations with an energy e cient operator. We then apply factorizations to the weight matrices so that the total number of parameters to be trained is signi cantly reduced. We then introduce online training algorithms. Through a set of experiments, we illustrate signi cant performance gains and complexity reductions achieved by the introduced algorithms with respect to the state of the art methods.	en_US
dc.description.statementofresponsibility	by Tolga Ergen.	en_US
dc.embargo.release	2021-07-17
dc.format.extent	xii, 87 leaves : graphics (some color) ; 30 cm.	en_US
dc.identifier.itemid	B158690
dc.identifier.uri	http://hdl.handle.net/11693/47693
dc.language.iso	English	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Online Learning	en_US
dc.subject	Recurrent Neural Network (RNN)	en_US
dc.subject	Extended Kalman ltering (EKF)	en_US
dc.subject	Particle ltering (PF)	en_US
dc.subject	Stochastic Gradient Descent (SGD)	en_US
dc.title	Online learning with recurrent neural networks	en_US
dc.title.alternative	Yinelenen sinir ağları ile çevrimiçi öğrenim	en_US
dc.type	Thesis	en_US
thesis.degree.discipline	Electrical and Electronic Engineering
thesis.degree.grantor	Bilkent University
thesis.degree.level	Master's
thesis.degree.name	MS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1

Name:: my_thesis.pdf
Size:: 2.15 MB
Format:: Adobe Portable Document Format
Description:: Full printable version

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Graduate School of Engineering and Science