Items where Author is "Luo, Shikai"
Number of items: 11.
Robust offline reinforcement learning with heavy-tailed rewards. (2024)
Zhu, Jin; Wan, Runzhe; Qi, Zhengling; Luo, Shikai; Shi, Chengchun
picture_as_pdf
A multiagent reinforcement learning framework for off-policy evaluation in two-sided markets. (2023)
Shi, Chengchun; Wan, Runzhe; Song, Ge; Luo, Shikai; Zhu, Hongtu; Song, Rui
picture_as_pdf
Conformal off-policy prediction. (2023)
Zhang, Yingying; Shi, Chengchun; Luo, Shikai
picture_as_pdf
Off-policy confidence interval estimation with confounded Markov decision process. (2022)
Shi, Chengchun; Zhu, Jin; Shen, Ye; Luo, Shikai; Zhu, Hongtu; Song, Rui
picture_as_pdf
Statistically efficient advantage learning for offline reinforcement learning in infinite horizons. (2022)
Shi, Chengchun; Luo, Shikai; Le, Yuan; Zhu, Hongtu; Song, Rui
picture_as_pdf
Dynamic causal effects evaluation in A/B testing with a reinforcement learning framework. (2022)
Shi, Chengchun; Wang, Xiaoyu; Luo, Shikai; Zhu, Hongtu; Ye, Jieping; Song, Rui
picture_as_pdf
DNet:distributional network for distributional individualized treatment effects.
Wu, Guojun; Song, Ge; Lv, Xiaoxiang; Luo, Shikai; Shi, Chengchun; Zhu, Hongtu
picture_as_pdf
Pattern transfer learning for reinforcement learning in order dispatching.
Wan, Runzhe; Zhang, Sheng; Shi, Chengchun; Luo, Shikai; Song, Rui
picture_as_pdf
Policy evaluation for temporal and/or spatial dependent experiments.
Luo, Shikai; Yang, Ying; Shi, Chengchun; Yao, Fang; Ye, Jieping; Zhu, Hongtu
picture_as_pdf
An instrumental variable approach to confounded off-policy evaluation.
Xu, Yang; Zhu, Jin; Shi, Chengchun; Luo, Shikai; Song, Rui
picture_as_pdf
An online sequential test for qualitative treatment effects.
Shi, Chengchun; Luo, Shikai; Zhu, Hongtu; Song, Rui
picture_as_pdf