Time and context sensitive optimization of machine learning models for sequential data prediction

buir.advisorKozat, Süleyman Serdar
dc.contributor.authorFazla, Arda
dc.date.accessioned2024-07-17T11:25:35Z
dc.date.available2024-07-17T11:25:35Z
dc.date.copyright2024-07
dc.date.issued2024-07
dc.date.submitted2024-07-16
dc.descriptionCataloged from PDF version of article.en_US
dc.descriptionIncludes bibliographical references (leaves 63-70).en_US
dc.description.abstractWe investigate the nonlinear prediction of sequential time series data through the mixture/combination of machine learning models. First, we introduce a novel ensemble learning approach that effectively combines multiple base learners in a time-aware and context-sensitive manner. This process involves a weight optimization problem targeting a specific loss function while considering (non)convex constraints on the linear combination of base learners. These constraints are theoretically analyzed under known statistics and are automatically incorporated into the meta-learner as part of the optimization process during training. Next, we introduce a direct two-stage approach based on the combination of linear and nonlinear models, where we jointly optimize the parameters of both models to minimize the final regression error. By this joint optimization, we alleviate the well-known underfitting and overfitting problems in modeling sequential data. As the linear model, we use a traditional linear time series forecasting model (SARIMAX) and as the nonlinear model, we use boosted soft decision trees (Soft GBDT). For both of our approaches, we illustrate notable performance improvements on real-life data and well-known competition datasets compared to traditional ensemble/mixture techniques and state-of-the-art forecasting models in the machine learning literature. Additionally, we make the source code of both of our approaches publicly available to facilitate further research and comparison.
dc.description.statementofresponsibilityby Arda Fazla
dc.format.extentxiii, 82 leaves : charts ; 30 cm.
dc.identifier.itemidB018558
dc.identifier.urihttps://hdl.handle.net/11693/115442
dc.language.isoEnglish
dc.rightsinfo:eu-repo/semantics/openAccess
dc.subjectEnsemble learning
dc.subjectPrediction / regression
dc.subjectTime series
dc.subjectStochastic gradient descent (SGD)
dc.subjectOnline learning
dc.subjectArtificial neural network (ANN)
dc.subjectLight gradient boosting machine (light GBM)
dc.subjectSeasonal auto-regressive integrated moving average with exogenous factors (SARIMAX)
dc.subjectSoft gradient boosting decision tree (soft GBDT)
dc.titleTime and context sensitive optimization of machine learning models for sequential data prediction
dc.title.alternativeMakine öğrenimi modellerinin sıralı veri tahmini için zaman ve bağlam duyarlı optimizasyonu
dc.typeThesis
thesis.degree.disciplineElectrical and Electronic Engineering
thesis.degree.grantorBilkent University
thesis.degree.levelMaster's
thesis.degree.nameMS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
B018558.pdf
Size:
5.34 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.1 KB
Format:
Item-specific license agreed upon to submission
Description: