Conservative policy construction using variational autoencoders for logged data with missing values

Abroshan, M.; Yip, K. H.; Tekin, Cem; Van Der Schaar, M.

Conservative policy construction using variational autoencoders for logged data with missing values

buir.contributor.author	Tekin, Cem
buir.contributor.orcid	Tekin, Cem\|0000-0003-4361-4021
dc.citation.epage	11	en_US
dc.citation.spage	1	en_US
dc.contributor.author	Abroshan, M.
dc.contributor.author	Yip, K. H.
dc.contributor.author	Tekin, Cem
dc.contributor.author	Van Der Schaar, M.
dc.date.accessioned	2023-02-16T06:14:31Z
dc.date.available	2023-02-16T06:14:31Z
dc.date.issued	2022-01-10
dc.department	Department of Electrical and Electronics Engineering	en_US
dc.description.abstract	In high-stakes applications of data-driven decision-making such as healthcare, it is of paramount importance to learn a policy that maximizes the reward while avoiding potentially dangerous actions when there is uncertainty. There are two main challenges usually associated with this problem. First, learning through online exploration is not possible due to the critical nature of such applications. Therefore, we need to resort to observational datasets with no counterfactuals. Second, such datasets are usually imperfect, additionally cursed with missing values in the attributes of features. In this article, we consider the problem of constructing personalized policies using logged data when there are missing values in the attributes of features in both training and test data. The goal is to recommend an action (treatment) when ~X, a degraded version of Xwith missing values, is observed. We consider three strategies for dealing with missingness. In particular, we introduce the conservative strategy where the policy is designed to safely handle the uncertainty due to missingness. In order to implement this strategy, we need to estimate posterior distribution p(X\|~X) and use a variational autoencoder to achieve this. In particular, our method is based on partial variational autoencoders (PVAEs) that are designed to capture the underlying structure of features with missing values.	en_US
dc.identifier.doi	10.1109/TNNLS.2021.3136385	en_US
dc.identifier.eissn	2162-2388
dc.identifier.issn	2162-237X
dc.identifier.uri	http://hdl.handle.net/11693/111379
dc.language.iso	English	en_US
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	en_US
dc.relation.isversionof	https://www.doi.org/10.1109/TNNLS.2021.3136385	en_US
dc.source.title	IEEE Transactions on Neural Networks and Learning Systems	en_US
dc.subject	Missing values	en_US
dc.subject	Observational data	en_US
dc.subject	Policy construction	en_US
dc.subject	Variational autoencoder	en_US
dc.title	Conservative policy construction using variational autoencoders for logged data with missing values	en_US
dc.type	Article	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Conservative_policy_construction_using_variational_autoencoders_for_logged_data_with_missing_values.pdf
Size:: 973.59 KB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.69 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Scholarly Publications - Electrical and Electronics Engineering