Low complexity efficient online learning algorithms using LSTM networks

buir.advisorKozat, Süleyman Serdar
dc.contributor.authorMirza, Ali Hassan
dc.date.accessioned2018-12-13T13:14:54Z
dc.date.available2018-12-13T13:14:54Z
dc.date.copyright2018-12
dc.date.issued2018-12
dc.date.submitted2018-12-12
dc.departmentDepartment of Electrical and Electronics Engineeringen_US
dc.descriptionCataloged from PDF version of article.en_US
dc.descriptionThesis (M.S.): Bilkent University, Department of Electrical and Electronics Engineering, İhsan Doğramacı Bilkent University, 2018.en_US
dc.descriptionIncludes bibliographical references (leaves 84-88).en_US
dc.description.abstractIn this thesis, we implement efficient online learning algorithms using the Long Short Term Memory (LSTM) networks with low time and computational complexity. In Chapter 2, we investigate efficient covariance information-based online learning using the LSTM networks known as Co-LSTM networks. We utilize the covariance information into the LSTM gating structure and propose various effi- cient models. We reduce the computational complexity by applying the Weight Matrix Factorization (WMF) trick and derive the additive gradient based updates. In Chapter 3, we give a practical application of the network intrusion detection using the Co-LSTM networks. In Chapter 4, we propose a boosted binary version of Tree-LSTM networks which we call BBT-LSTM networks. We introduce the depth and windowing factor into the N-ary Tree-LSTM networks where each LSTM node is binarily split and the whole tree architecture grows in a balanced manner. In order to reduce the computational complexity of the BBT-LSTM networks, we apply WMF trick, replace the regular multiplication operator with the energy efficient operator and finally introduce the slicing operation on the BBT-LSTM network weight matrices. In Chapter 5, we propose another low complexity LSTM network based on a minimum number of hopping over the input data sequence. We study two methods to select the appropriate value of the hopping distance. Through an extensive set of experiments using the real-life data sets, we demonstrate the significant increase in the performance of the proposed algorithms at the end of each chapter.en_US
dc.description.degreeM.S.en_US
dc.description.provenanceSubmitted by Betül Özen (ozen@bilkent.edu.tr) on 2018-12-13T13:14:54Z No. of bitstreams: 1 thesis .pdf: 2009201 bytes, checksum: 7477e2a043d1263c420fb9ce6f2a8eae (MD5)en
dc.description.provenanceMade available in DSpace on 2018-12-13T13:14:54Z (GMT). No. of bitstreams: 1 thesis .pdf: 2009201 bytes, checksum: 7477e2a043d1263c420fb9ce6f2a8eae (MD5) Previous issue date: 2018-12en
dc.description.statementofresponsibilityby Ali Hassan Mirza.en_US
dc.embargo.release2020-12-10
dc.format.extentxvi, 97 leaves : charts (some color) ; 30 cm.en_US
dc.identifier.itemidB159503
dc.identifier.urihttp://hdl.handle.net/11693/48199
dc.language.isoEnglishen_US
dc.publisherBilkent Universityen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectOnline Learningen_US
dc.subjectLSTMen_US
dc.subjectCovarianceen_US
dc.subjectTree-LSTMen_US
dc.subjectBoostingen_US
dc.subjectRegressionen_US
dc.subjectWMFen_US
dc.titleLow complexity efficient online learning algorithms using LSTM networksen_US
dc.title.alternativeUKSB ağları ile düşük karmaşıklığa sahip verimli çevrimiçi öğrenme algoritmalarıen_US
dc.typeThesisen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesis .pdf
Size:
1.92 MB
Format:
Adobe Portable Document Format
Description:
Full printable version
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: