End-to-end hybrid architectures for effective sequential data prediction

Aydın, Mustafa Enes

End-to-end hybrid architectures for effective sequential data prediction

buir.advisor	Kozat, Süleyman Serdar
dc.contributor.author	Aydın, Mustafa Enes
dc.date.accessioned	2023-09-07T08:35:13Z
dc.date.available	2023-09-07T08:35:13Z
dc.date.copyright	2023-08
dc.date.issued	2023-08
dc.date.submitted	2023-09-04
dc.description	Cataloged from PDF version of article.	en_US
dc.description	Includes bibliographical references (leaves 57-64).	en_US
dc.description.abstract	We investigate nonlinear prediction in an online setting and introduce two hybrid models that effectively mitigate, via end-to-end architectures, the need for hand-designed features and manual model selection issues of conventional nonlinear prediction/regression methods. Particularly, we first use an enhanced recurrent neural network (LSTM) to extract features from sequential signals, while pre-serving the state information, i.e., the history, and soft gradient boosted decision trees (sGBDT) to produce the final output. The connection is in an end-to-end fashion and we jointly optimize the whole architecture using stochastic gradient descent. Secondly, we again use recursive structures (LSTM) for automatic fea-ture extraction out of raw data but accompany it with a traditional linear time series model (SARIMAX) to deal with the intricacies of the sequential data, e.g., seasonality. The unification of the models is again in a joint manner; it is through a single state space and we optimize the entire architecture using particle filter-ing. The proposed frameworks are generic so that one can use other recurrent architectures, e.g., GRUs, and differentiable machine learning algorithms as well as time series models that have state space representations in lieu of the specific models presented. We demonstrate the learning behavior of the models on syn-thetic data and the significant performance improvements over the conventional methods and the disjoint counterparts over various real life datasets, with which we also show the generic nature of the frameworks. Furthermore, we openly share the source code of the proposed methods to facilitate further research.
dc.description.statementofresponsibility	by Mustafa Enes Aydın
dc.format.extent	xi, 64 leaves ; 30 cm.
dc.identifier.itemid	B162492
dc.identifier.uri	https://hdl.handle.net/11693/113838
dc.language.iso	English
dc.rights	info:eu-repo/semantics/openAccess
dc.subject	Online learning
dc.subject	Prediction
dc.subject	Time series
dc.subject	End-to-end learning
dc.subject	Long short-term memory (LSTM)
dc.subject	Soft gradient boosting decision tree (sGBDT)
dc.subject	Sea-sonal auto-regressive integrated moving average with exogenous regressors (SARI-MAX)
dc.title	End-to-end hybrid architectures for effective sequential data prediction
dc.title.alternative	Etkili ardışık veri tahmini için uçtan uca melez mimariler
dc.type	Thesis
thesis.degree.discipline	Electrical and Electronic Engineering
thesis.degree.grantor	Bilkent University
thesis.degree.level	Master's
thesis.degree.name	MS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1

Name:: B162492.pdf
Size:: 838.22 KB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.01 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Graduate School of Engineering and Science