Deep jump learning for off-policy evaluation in continuous treatment settings

Cai, H., Shi, C.

, Song, R. & Lu, W. (2021). Deep jump learning for off-policy evaluation in continuous treatment settings. In Proceedings of the 35th Conference on Neural Information Processing Systems .

Copy

We consider off-policy evaluation (OPE) in continuous treatment settings, such as personalized dose-finding. In OPE, one aims to estimate the mean outcome under a new treatment decision rule using historical data generated by a different decision rule. Most existing works on OPE focus on discrete treatment settings. To handle continuous treatments, we develop a novel estimation method for OPE using deep jump learning. The key ingredient of our method lies in adaptively discretizing the treatment space using deep discretization, by leveraging deep learning and multi-scale change point detection. This allows us to apply existing OPE methods in discrete treatments to handle continuous treatments. Our method is further justified by theoretical results, simulations, and a real application to Warfarin Dosing.

Item Type	Chapter
Copyright holders	© 2021 The Authors
Departments	LSE > Academic Departments > Statistics
Date Deposited	13 October 2021
Acceptance Date	28 September 2021
URI	https://researchonline.lse.ac.uk/id/eprint/112419

Explore Further

Shi, Chengchun

https://nips.cc/ (Author)
https://www.lse.ac.uk/Statistics/People/Dr-Chengchun-Shi (Author)
https://proceedings.neurips.cc/paper/2021/hash/816b112c6105b3ebd537828a39af4818-Abstract.html
https://proceedings.neurips.cc/ (Official URL)

Deep jump learning for off-policy evaluation in continuous treatment settings

Explore Further

Export as