Time and context sensitive optimization of machine learning models for sequential data prediction

Fazla, Arda

Time and context sensitive optimization of machine learning models for sequential data prediction

Files

B018558.pdf (5.34 MB)

Date

2024-07

Authors

Fazla, Arda

Advisor

Kozat, Süleyman Serdar

BUIR Usage Stats

12
views

40
downloads

Abstract

We investigate the nonlinear prediction of sequential time series data through the mixture/combination of machine learning models. First, we introduce a novel ensemble learning approach that effectively combines multiple base learners in a time-aware and context-sensitive manner. This process involves a weight optimization problem targeting a specific loss function while considering (non)convex constraints on the linear combination of base learners. These constraints are theoretically analyzed under known statistics and are automatically incorporated into the meta-learner as part of the optimization process during training. Next, we introduce a direct two-stage approach based on the combination of linear and nonlinear models, where we jointly optimize the parameters of both models to minimize the final regression error. By this joint optimization, we alleviate the well-known underfitting and overfitting problems in modeling sequential data. As the linear model, we use a traditional linear time series forecasting model (SARIMAX) and as the nonlinear model, we use boosted soft decision trees (Soft GBDT). For both of our approaches, we illustrate notable performance improvements on real-life data and well-known competition datasets compared to traditional ensemble/mixture techniques and state-of-the-art forecasting models in the machine learning literature. Additionally, we make the source code of both of our approaches publicly available to facilitate further research and comparison.

Keywords

Ensemble learning, Prediction / regression, Time series, Stochastic gradient descent (SGD), Online learning, Artificial neural network (ANN), Light gradient boosting machine (light GBM), Seasonal auto-regressive integrated moving average with exogenous factors (SARIMAX), Soft gradient boosting decision tree (soft GBDT)

Degree Discipline

Electrical and Electronic Engineering

Degree Level

Master's

Degree Name

MS (Master of Science)

Permalink

https://hdl.handle.net/11693/115442

Collections

Graduate School of Engineering and Science

Language

English

Type

Thesis

Full item page

Time and context sensitive optimization of machine learning models for sequential data prediction

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type

Time and context sensitive optimization of machine learning models for sequential data prediction

Files

Date

Authors

Editor(s)

Advisor

Supervisor

Co-Advisor

Co-Supervisor

Instructor

BUIR Usage Stats

Share

Series

Abstract

Source Title

Publisher

Course

Other identifiers

Book Title

Keywords

Degree Discipline

Degree Level

Degree Name

Citation

Permalink

Published Version (Please cite this version)

Collections

Language

Type