End-to-end hybrid architectures for effective sequential data prediction

buir.advisorKozat, Süleyman Serdar
dc.contributor.authorAydın, Mustafa Enes
dc.date.accessioned2023-09-07T08:35:13Z
dc.date.available2023-09-07T08:35:13Z
dc.date.copyright2023-08
dc.date.issued2023-08
dc.date.submitted2023-09-04
dc.descriptionCataloged from PDF version of article.
dc.descriptionThesis (Master's): Bilkent University, Department of Electrical and Electronics Engineering, İhsan Doğramacı Bilkent University, 2023.
dc.descriptionIncludes bibliographical references (leaves 57-64).
dc.description.abstractWe investigate nonlinear prediction in an online setting and introduce two hybrid models that effectively mitigate, via end-to-end architectures, the need for hand-designed features and manual model selection issues of conventional nonlinear prediction/regression methods. Particularly, we first use an enhanced recurrent neural network (LSTM) to extract features from sequential signals, while pre-serving the state information, i.e., the history, and soft gradient boosted decision trees (sGBDT) to produce the final output. The connection is in an end-to-end fashion and we jointly optimize the whole architecture using stochastic gradient descent. Secondly, we again use recursive structures (LSTM) for automatic fea-ture extraction out of raw data but accompany it with a traditional linear time series model (SARIMAX) to deal with the intricacies of the sequential data, e.g., seasonality. The unification of the models is again in a joint manner; it is through a single state space and we optimize the entire architecture using particle filter-ing. The proposed frameworks are generic so that one can use other recurrent architectures, e.g., GRUs, and differentiable machine learning algorithms as well as time series models that have state space representations in lieu of the specific models presented. We demonstrate the learning behavior of the models on syn-thetic data and the significant performance improvements over the conventional methods and the disjoint counterparts over various real life datasets, with which we also show the generic nature of the frameworks. Furthermore, we openly share the source code of the proposed methods to facilitate further research.
dc.description.provenanceMade available in DSpace on 2023-09-07T08:35:13Z (GMT). No. of bitstreams: 1 B162492.pdf: 858336 bytes, checksum: 513cd3fc4cdf0fbe5067bc96eebd118a (MD5) Previous issue date: 2023-08en
dc.description.statementofresponsibilityby Mustafa Enes Aydın
dc.format.extentxi, 64 leaves ; 30 cm.
dc.identifier.itemidB162492
dc.identifier.urihttps://hdl.handle.net/11693/113838
dc.language.isoEnglish
dc.rightsinfo:eu-repo/semantics/openAccess
dc.subjectOnline learning
dc.subjectPrediction
dc.subjectTime series
dc.subjectEnd-to-end learning
dc.subjectLong short-term memory (LSTM)
dc.subjectSoft gradient boosting decision tree (sGBDT)
dc.subjectSea-sonal auto-regressive integrated moving average with exogenous regressors (SARI-MAX)
dc.titleEnd-to-end hybrid architectures for effective sequential data prediction
dc.title.alternativeEtkili ardışık veri tahmini için uçtan uca melez mimariler
dc.typeThesis
thesis.degree.disciplineElectrical and Electronic Engineering
thesis.degree.grantorBilkent University
thesis.degree.levelMaster's
thesis.degree.nameMS (Master of Science)

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
B162492.pdf
Size:
838.22 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.01 KB
Format:
Item-specific license agreed upon to submission
Description: