Time and context sensitive optimization of machine learning models for sequential data prediction

Fazla, Arda

Time and context sensitive optimization of machine learning models for sequential data prediction

buir.advisor	Kozat, Süleyman Serdar
dc.contributor.author	Fazla, Arda
dc.date.accessioned	2024-07-17T11:25:35Z
dc.date.available	2024-07-17T11:25:35Z
dc.date.copyright	2024-07
dc.date.issued	2024-07
dc.date.submitted	2024-07-16
dc.description	Cataloged from PDF version of article.	en_US
dc.description	Includes bibliographical references (leaves 63-70).	en_US
dc.description.abstract	We investigate the nonlinear prediction of sequential time series data through the mixture/combination of machine learning models. First, we introduce a novel ensemble learning approach that effectively combines multiple base learners in a time-aware and context-sensitive manner. This process involves a weight optimization problem targeting a specific loss function while considering (non)convex constraints on the linear combination of base learners. These constraints are theoretically analyzed under known statistics and are automatically incorporated into the meta-learner as part of the optimization process during training. Next, we introduce a direct two-stage approach based on the combination of linear and nonlinear models, where we jointly optimize the parameters of both models to minimize the final regression error. By this joint optimization, we alleviate the well-known underfitting and overfitting problems in modeling sequential data. As the linear model, we use a traditional linear time series forecasting model (SARIMAX) and as the nonlinear model, we use boosted soft decision trees (Soft GBDT). For both of our approaches, we illustrate notable performance improvements on real-life data and well-known competition datasets compared to traditional ensemble/mixture techniques and state-of-the-art forecasting models in the machine learning literature. Additionally, we make the source code of both of our approaches publicly available to facilitate further research and comparison.
dc.description.statementofresponsibility	by Arda Fazla
dc.format.extent	xiii, 82 leaves : charts ; 30 cm.
dc.identifier.itemid	B018558
dc.identifier.uri	https://hdl.handle.net/11693/115442
dc.language.iso	English
dc.rights	info:eu-repo/semantics/openAccess
dc.subject	Ensemble learning
dc.subject	Prediction / regression
dc.subject	Time series
dc.subject	Stochastic gradient descent (SGD)
dc.subject	Online learning
dc.subject	Artificial neural network (ANN)
dc.subject	Light gradient boosting machine (light GBM)
dc.subject	Seasonal auto-regressive integrated moving average with exogenous factors (SARIMAX)
dc.subject	Soft gradient boosting decision tree (soft GBDT)
dc.title	Time and context sensitive optimization of machine learning models for sequential data prediction
dc.title.alternative	Makine öğrenimi modellerinin sıralı veri tahmini için zaman ve bağlam duyarlı optimizasyonu
dc.type	Thesis
thesis.degree.discipline	Electrical and Electronic Engineering
thesis.degree.grantor	Bilkent University
thesis.degree.level	Master's
thesis.degree.name	MS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1

Name:: B018558.pdf
Size:: 5.34 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.1 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Graduate School of Engineering and Science