End-to-end hybrid architectures for effective sequential data prediction

Aydın, Mustafa Enes2023-09-072023-09-072023-082023-082023-09-04https://hdl.handle.net/11693/113838Cataloged from PDF version of article.Includes bibliographical references (leaves 57-64).We investigate nonlinear prediction in an online setting and introduce two hybrid models that effectively mitigate, via end-to-end architectures, the need for hand-designed features and manual model selection issues of conventional nonlinear prediction/regression methods. Particularly, we first use an enhanced recurrent neural network (LSTM) to extract features from sequential signals, while pre-serving the state information, i.e., the history, and soft gradient boosted decision trees (sGBDT) to produce the final output. The connection is in an end-to-end fashion and we jointly optimize the whole architecture using stochastic gradient descent. Secondly, we again use recursive structures (LSTM) for automatic fea-ture extraction out of raw data but accompany it with a traditional linear time series model (SARIMAX) to deal with the intricacies of the sequential data, e.g., seasonality. The unification of the models is again in a joint manner; it is through a single state space and we optimize the entire architecture using particle filter-ing. The proposed frameworks are generic so that one can use other recurrent architectures, e.g., GRUs, and differentiable machine learning algorithms as well as time series models that have state space representations in lieu of the specific models presented. We demonstrate the learning behavior of the models on syn-thetic data and the significant performance improvements over the conventional methods and the disjoint counterparts over various real life datasets, with which we also show the generic nature of the frameworks. Furthermore, we openly share the source code of the proposed methods to facilitate further research.xi, 64 leaves ; 30 cm.Englishinfo:eu-repo/semantics/openAccessOnline learningPredictionTime seriesEnd-to-end learningLong short-term memory (LSTM)Soft gradient boosting decision tree (sGBDT)Sea-sonal auto-regressive integrated moving average with exogenous regressors (SARI-MAX)End-to-end hybrid architectures for effective sequential data predictionEtkili ardışık veri tahmini için uçtan uca melez mimarilerThesisB162492