Conservative policy construction using variational autoencoders for logged data with missing values

buir.contributor.authorTekin, Cem
buir.contributor.orcidTekin, Cem|0000-0003-4361-4021
dc.citation.epage11en_US
dc.citation.spage1en_US
dc.contributor.authorAbroshan, M.
dc.contributor.authorYip, K. H.
dc.contributor.authorTekin, Cem
dc.contributor.authorVan Der Schaar, M.
dc.date.accessioned2023-02-16T06:14:31Z
dc.date.available2023-02-16T06:14:31Z
dc.date.issued2022-01-10
dc.departmentDepartment of Electrical and Electronics Engineeringen_US
dc.description.abstractIn high-stakes applications of data-driven decision-making such as healthcare, it is of paramount importance to learn a policy that maximizes the reward while avoiding potentially dangerous actions when there is uncertainty. There are two main challenges usually associated with this problem. First, learning through online exploration is not possible due to the critical nature of such applications. Therefore, we need to resort to observational datasets with no counterfactuals. Second, such datasets are usually imperfect, additionally cursed with missing values in the attributes of features. In this article, we consider the problem of constructing personalized policies using logged data when there are missing values in the attributes of features in both training and test data. The goal is to recommend an action (treatment) when ~X, a degraded version of Xwith missing values, is observed. We consider three strategies for dealing with missingness. In particular, we introduce the conservative strategy where the policy is designed to safely handle the uncertainty due to missingness. In order to implement this strategy, we need to estimate posterior distribution p(X|~X) and use a variational autoencoder to achieve this. In particular, our method is based on partial variational autoencoders (PVAEs) that are designed to capture the underlying structure of features with missing values.en_US
dc.description.provenanceSubmitted by Fatma Kaya (fattttoky.55@gmail.com) on 2023-02-16T06:14:31Z No. of bitstreams: 1 Conservative_policy_construction_using_variational_autoencoders_for_logged_data_with_missing_values.pdf: 996953 bytes, checksum: e1e26212117f3d73d55a9cb422c63383 (MD5)en
dc.description.provenanceMade available in DSpace on 2023-02-16T06:14:31Z (GMT). No. of bitstreams: 1 Conservative_policy_construction_using_variational_autoencoders_for_logged_data_with_missing_values.pdf: 996953 bytes, checksum: e1e26212117f3d73d55a9cb422c63383 (MD5) Previous issue date: 2022-01-10en
dc.identifier.doi10.1109/TNNLS.2021.3136385en_US
dc.identifier.eissn2162-2388
dc.identifier.issn2162-237X
dc.identifier.urihttp://hdl.handle.net/11693/111379
dc.language.isoEnglishen_US
dc.publisherInstitute of Electrical and Electronics Engineers Inc.en_US
dc.relation.isversionofhttps://www.doi.org/10.1109/TNNLS.2021.3136385en_US
dc.source.titleIEEE Transactions on Neural Networks and Learning Systemsen_US
dc.subjectMissing valuesen_US
dc.subjectObservational dataen_US
dc.subjectPolicy constructionen_US
dc.subjectVariational autoencoderen_US
dc.titleConservative policy construction using variational autoencoders for logged data with missing valuesen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Conservative_policy_construction_using_variational_autoencoders_for_logged_data_with_missing_values.pdf
Size:
973.59 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.69 KB
Format:
Item-specific license agreed upon to submission
Description: