Personalizing treatments via contextual multi-armed bandits by identifying relevance
buir.advisor | Tekin, Cem | |
dc.contributor.author | Bulucu, Cem | |
dc.date.accessioned | 2019-08-23T06:24:50Z | |
dc.date.available | 2019-08-23T06:24:50Z | |
dc.date.copyright | 2019-08 | |
dc.date.issued | 2019-08 | |
dc.date.submitted | 2019-08-21 | |
dc.description | Cataloged from PDF version of article. | en_US |
dc.description | Thesis (M.S.): Bilkent University, Department of Electrical and Electronics Engineering, İhsan Doğramacı Bilkent University, 2019. | en_US |
dc.description | Includes bibliographical references (leaves 57-62). | en_US |
dc.description.abstract | Personalized medicine offers specialized treatment options for individuals which is vital as every patient is different. One-size-fits-all approaches are often not effective and most patients require personalized care when dealing with various diseases like cancer, heart diseases or diabetes. As vast amounts of data became available in medicine (and otherfields including web-based recommender systems and intelligent radio networks), online learning approaches are gaining popularity due to their ability to learn fast in uncertain environments. Contextual multi-armed bandit algorithms provide reliable sequential decision-making options in such applications. In medical settings (also in other aforementioned settings), data (contexts) and actions (arms) are often high-dimensional and performances of traditional contextual multi-armed bandit approaches are almost as bad as random selection, due to the curse of dimensionality. Fortunately, in many cases the information relevant to the decision-making task does not depend on all dimensions but rather depends on a small subset of dimensions, called the relevant dimensions. In this thesis, we aim to provide personalized treatments for patients sequentially arriving over time by using contextual multi-armed bandit approaches when the expected rewards related to patient outcomes only vary on a small subset of context and arm dimensions. For this purpose,first we make use of the contextual multi-armed bandit with relevance learning (CMAB-RL) algorithm which learns the relevance by employing a novel partitioning strategy on the context-arm space and forming a set of candidate relevant dimension tuples. In this model, the set of relevant patient traits are allowed to be different for different bolus insulin dosages. Next, we consider an environment where the expected reward function defined over the context-arm space is sampled from a Gaussian process. For this setting, we propose an extension to the contextual Gaussian process upper confidence bound (CGP-UCB) algorithm, called CGP-UCB with relevance learning (CGP-UCB-RL), that learns the relevance by integrating kernels that allow weights to be associated with each dimension and optimizing the negative log marginal likelihood. Then, we investigate the suitability of this approach in the blood glucose regulation problem. Aside from applying both algorithms to the bolus insulin administration problem, we also evaluate their performance in synthetically generated environments as benchmarks. | en_US |
dc.description.provenance | Submitted by Betül Özen (ozen@bilkent.edu.tr) on 2019-08-23T06:24:50Z No. of bitstreams: 1 cem_bulucu_thesis.pdf: 12795475 bytes, checksum: c5dc020f06ff51dc1b1275b2f39c0418 (MD5) | en |
dc.description.provenance | Made available in DSpace on 2019-08-23T06:24:50Z (GMT). No. of bitstreams: 1 cem_bulucu_thesis.pdf: 12795475 bytes, checksum: c5dc020f06ff51dc1b1275b2f39c0418 (MD5) Previous issue date: 2019-08 | en |
dc.description.statementofresponsibility | by Cem Bulucu | en_US |
dc.embargo.release | 2020-02-09 | |
dc.format.extent | xii, 62 leaves : charts (some color) ; 30 cm. | en_US |
dc.identifier.itemid | B105647 | |
dc.identifier.uri | http://hdl.handle.net/11693/52359 | |
dc.language.iso | English | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.subject | Online Learning | en_US |
dc.subject | Contextual multi-armed bandits | en_US |
dc.subject | Contextual Gaussian process bandits | en_US |
dc.subject | Relevance learning | en_US |
dc.subject | Personalized medicine | en_US |
dc.title | Personalizing treatments via contextual multi-armed bandits by identifying relevance | en_US |
dc.title.alternative | İlgi belirleyerek bağlamsal çok kollu haydutlar ile tedavileri kişiselleştirme | en_US |
dc.type | Thesis | en_US |
thesis.degree.discipline | Electrical and Electronic Engineering | |
thesis.degree.grantor | Bilkent University | |
thesis.degree.level | Master's | |
thesis.degree.name | MS (Master of Science) |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- cem_bulucu_thesis.pdf
- Size:
- 12.2 MB
- Format:
- Adobe Portable Document Format
- Description:
- Full printable version
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: