Items where Author is "Uehara, Masatoshi"
Number of items: 3.
Future-dependent value-based off-policy evaluation in POMDPs.
Uehara, Masatoshi; Kiyohara, Haruka; Bennett, Andrew; Chernozhukov, Victor; Jiang, Nan; Kallus, Nathan; Shi, Chengchun; Sun, Wenguang
picture_as_pdf
A minimax learning approach to off-policy evaluation in confounded Partially Observable Markov Decision Processes.
Shi, Chengchun; Uehara, Masatoshi; Uehara, Masatoshi; Huang, Jiawei; Jiang, Nan
picture_as_pdf
A review of off-policy evaluation in reinforcement learning.
Uehara, Masatoshi; Shi, Chengchun; Kallus, Nathan
picture_as_pdf