A hybrid framework for sequential data prediction with end-to-end optimization

buir.contributor.authorKozat, Süleyman S.
buir.contributor.orcidKozat, Süleyman S.|0000-0002-6488-3848
dc.citation.epage12en_US
dc.citation.spage1en_US
dc.citation.volumeNumber129en_US
dc.contributor.authorAydin, M.E.
dc.contributor.authorKozat, Süleyman S.
dc.date.accessioned2023-02-20T07:46:01Z
dc.date.available2023-02-20T07:46:01Z
dc.date.issued2022-08-08
dc.departmentDepartment of Electrical and Electronics Engineeringen_US
dc.description.abstractWe investigate nonlinear prediction in an online setting and introduce a hybrid model that effectively mitigates, via an end-to-end architecture, the need for hand-designed features and manual model selection issues of conventional nonlinear prediction/regression methods. In particular, we use recursive structures to extract features from sequential signals, while preserving the state information, i.e., the history, and boosted decision trees to produce the final output. The connection is in an end-to-end fashion and we jointly optimize the whole architecture using stochastic gradient descent, for which we also provide the backward pass update equations. In particular, we employ a recurrent neural network (LSTM) for adaptive feature extraction from sequential data and a gradient boosting machinery (soft GBDT) for effective supervised regression. Our framework is generic so that one can use other deep learning architectures for feature extraction (such as RNNs and GRUs) and machine learning algorithms for decision making as long as they are differentiable. We demonstrate the learning behavior of our algorithm on synthetic data and the significant performance improvements over the conventional methods over various real life datasets. Furthermore, we openly share the source code of the proposed method to facilitate further research. © 2022 Elsevier Inc.en_US
dc.identifier.doi10.1016/j.dsp.2022.103687en_US
dc.identifier.eissn1095-4333
dc.identifier.issn1051-2004
dc.identifier.urihttp://hdl.handle.net/11693/111542
dc.language.isoEnglishen_US
dc.publisherElsevieren_US
dc.relation.isversionofhttps://dx.doi.org/10.1016/j.dsp.2022.103687en_US
dc.source.titleDigital Signal Processingen_US
dc.subjectEnd-to-end learningen_US
dc.subjectFeature extractionen_US
dc.subjectLong short-term memory (LSTM)en_US
dc.subjectOnline learningen_US
dc.subjectPredictionen_US
dc.subjectSoft gradient boosting decision tree (sGBDT)en_US
dc.titleA hybrid framework for sequential data prediction with end-to-end optimizationen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
A hybrid framework for sequential data prediction with end-to-end optimization.pdf
Size:
1.24 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.69 KB
Format:
Item-specific license agreed upon to submission
Description: