Prediction with expert advice: on the role of contexts, bandit feedback and risk-awareness

Ekşioğlu, Kubilay

Prediction with expert advice: on the role of contexts, bandit feedback and risk-awareness

buir.advisor	Tekin, Cem
dc.contributor.author	Ekşioğlu, Kubilay
dc.date.accessioned	2018-12-25T12:18:45Z
dc.date.available	2018-12-25T12:18:45Z
dc.date.copyright	2018-12
dc.date.issued	2018-12
dc.date.submitted	2018-12-21
dc.description	Cataloged from PDF version of article.	en_US
dc.description	Includes bibliographical references (leaves 54-59).	en_US
dc.description.abstract	Along with the rapid growth in the size of data generated and collected over time, the need for developing online algorithms that can provide answers without any offline training has considerably increased. In this thesis, we consider the prediction with expert advice problem under the online learning framework. Specifically, we consider problems where experts have asymmetric information about the sample space. First, we propose an algorithm that selects a subset of the experts and makes predictions based on the advices of this subset. Then, we propose another algorithm that clusters samples in an online manner and makes predictions based on the history of observations and decisions within each cluster. Next, we consider the Safe Bandit, a variant of the Risk Aware Multi Armed Bandit, where the goal is to minimize the number of rounds in which a risky arm is chosen. Adopting mean-variance as the risk notion, we define an arm as risky if its mean-variance is higher than a given threshold. Using this, we define a new regret measure called Risk Violation Regret (RVR), which depends on the number of times risky arms are selected. Then, we propose a learning algorithm called Exploration and Exploitation with Risk Thresholds (EXERT), and prove that it achieves O(1) RVR with high probability. Afterwards, we use EXERT in an expert selection problem, where each expert corresponds to a neural network with reject option. For this, we propose a method to train these neural networks and use them to evaluate the performance of EXERT in real-world datasets.	en_US
dc.description.statementofresponsibility	by Kubilay Ekşioğlu.	en_US
dc.embargo.release	2019-06-21
dc.format.extent	xi, 59 leaves : charts (some color) ; 30 cm.	en_US
dc.identifier.itemid	B159204
dc.identifier.uri	http://hdl.handle.net/11693/48211
dc.language.iso	English	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Prediction with Expert Advice	en_US
dc.subject	Multi Armed Bandits	en_US
dc.subject	Online Learning	en_US
dc.subject	Neural Networks	en_US
dc.title	Prediction with expert advice: on the role of contexts, bandit feedback and risk-awareness	en_US
dc.title.alternative	Uzman önerileriyle tahmin: bağlamların, haydut geribildirimin ve risk farkındalığının rolü üzerine	en_US
dc.type	Thesis	en_US
thesis.degree.discipline	Electrical and Electronic Engineering
thesis.degree.grantor	Bilkent University
thesis.degree.level	Master's
thesis.degree.name	MS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1

Name:: KubilayEksioglu_10225045.pdf
Size:: 624.2 KB
Format:: Adobe Portable Document Format
Description:: Full printable version

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Graduate School of Engineering and Science