Number of items: 52.
Article
Off-policy evaluation in doubly inhomogeneous environments. (2025)
Bian, Zeyu; Shi, Chengchun; Qi, Zhengling; Wang, Lan
picture_as_pdf
A multiagent reinforcement learning framework for off-policy evaluation in two-sided markets. (2023)
Shi, Chengchun; Wan, Runzhe; Song, Ge; Luo, Shikai; Zhu, Hongtu; Song, Rui
picture_as_pdf
Conformal off-policy prediction. (2023)
Zhang, Yingying; Shi, Chengchun; Luo, Shikai
picture_as_pdf
Sequential pathway inference for multimodal neuroimaging analysis. (2022)
Li, Lexin; Shi, Chengchun; Guo, Tengfei; Jagust, William J.
picture_as_pdf
Off-policy confidence interval estimation with confounded Markov decision process. (2022)
Shi, Chengchun; Zhu, Jin; Shen, Ye; Luo, Shikai; Zhu, Hongtu; Song, Rui
picture_as_pdf
Testing mediation effects using logic of Boolean matrices. (2022)
Shi, Chengchun; Li, Lexin
picture_as_pdf
Statistically efficient advantage learning for offline reinforcement learning in infinite horizons. (2022)
Shi, Chengchun; Luo, Shikai; Le, Yuan; Zhu, Hongtu; Song, Rui
picture_as_pdf
Statistical inference of the value function for reinforcement learning in infinite-horizon settings. (2022)
Shi, Chengchun; Zhang, Shengxing; Lu, Wenbin; Song, Rui
picture_as_pdf
Dynamic causal effects evaluation in A/B testing with a reinforcement learning framework. (2022)
Shi, Chengchun; Wang, Xiaoyu; Luo, Shikai; Zhu, Hongtu; Ye, Jieping; Song, Rui
picture_as_pdf
Breaking the curse of nonregularity with subagging:inference of the mean outcome under optimal treatment regimes. (2020)
Shi, Chengchun; Lu, Wenbin; Song, Rui
picture_as_pdf
Combining experimental and historical data for policy evaluation.
Li, Ting; Shi, Chengchun; Wen, Qianglin; Sui, Yang; Qin, Yongli; Lai, Chunbo; Zhu, Hongtu
picture_as_pdf
Concordance and value information criteria for optimal treatment decision.
Shi, Chengchun; Song, R; Lu, W
picture_as_pdf
DNet:distributional network for distributional individualized treatment effects.
Wu, Guojun; Song, Ge; Lv, Xiaoxiang; Luo, Shikai; Shi, Chengchun; Zhu, Hongtu
picture_as_pdf
Deep spectral Q-learning with application to mobile health.
Gao, Yuhe; Shi, Chengchun; Song, Rui
picture_as_pdf
Determining the number of latent factors in statistical multi-relational learning.
Shi, Chengchun; Lu, Wenbin; Song, Rui
picture_as_pdf
Double generative adversarial networks for conditional independence testing.
Shi, Chengchun; Xu, Tianlin; Bergsma, Wicher; Li, Lexin
picture_as_pdf
Dynamic noise estimation:a generalized method for modeling noise fluctuations in decision-making.
Li, Jing Jing; Shi, Chengchun; Li, Lexin; Collins, Anne G.E.
picture_as_pdf
Evaluating dynamic conditional quantile treatment effects with applications in ridesharing.
Li, Ting; Shi, Chengchun; Lu, Zhaohua; Li, Yi; Zhu, Hongtu
picture_as_pdf
High-dimensional A-learning for optimal dynamic treatment regimes.
Shi, Chengchun; Fan, Ailin; Song, Rui; Lu, Wenbin
picture_as_pdf
Jump interval-learning for individualized decision making with continuous treatments.
Cai, Hengrui; Shi, Chengchun; Song, Rui; Lu, Wenbin
picture_as_pdf
Linear hypothesis testing for high dimensional generalized linear models.
Shi, Chengchun; Song, Rui; Chen, Zhao; Li, Runze
picture_as_pdf
Maximin projection learning for optimal treatment decision with heterogeneous individualized treatment effects.
Shi, Chengchun; Song, Rui; Lu, Wenbin; Fu, Bo
picture_as_pdf
Multivariate dynamic mediation analysis under a reinforcement learning framework.
Lan Luo, By; Shi, Chengchun; Wang, Jitao; Wu, Zhenke; Li, Lexin
picture_as_pdf
On testing conditional qualitative treatment effects.
Shi, Chengchun; Song, Rui; Lu, Wenbin
picture_as_pdf
Optimizing pessimism in dynamic treatment regimes:a Bayesian learning approach.
Zhou, Yunzhe; Qi, Zhengling; Shi, Chengchun; Li, Lexin
picture_as_pdf
Policy evaluation for temporal and/or spatial dependent experiments.
Luo, Shikai; Yang, Ying; Shi, Chengchun; Yao, Fang; Ye, Jieping; Zhu, Hongtu
picture_as_pdf
Robust learning for optimal treatment decision with NP-dimensionality.
Shi, Chengchun; Song, Rui; Lu, Wenbin
picture_as_pdf
Robust offline reinforcement learning with heavy-tailed rewards.
Zhu, Jin; Wan, Runzhe; Qi, Zhengling; Luo, Shikai; Shi, Chengchun
picture_as_pdf
Statistical inference for high-dimensional models via recursive online-score estimation.
Shi, Chengchun; Song, Rui; Lu, Wenbin; Li, Runzi
picture_as_pdf
Statistics and AI:a rireside conversation.
Lin, Xihong; Cai, Tianxi; Donoho, David; Fu, Haoda; Ke, Tracy; Jin, Jiashun; Meng, Xiao-Li; Qu, Annie; Shi, Chengchun; Song, Peter; Sun, Qiang; Wang, Wenyi; Wu, Hulin; Yu, Bin; Zhang, Heping; Zheng, Tian; Zhou, Harrison; Zhou, Jin; Zhu, Hongtu; Zhu, Ji
picture_as_pdf
Testing directed acyclic graph via structural, supervised and generative adversarial learning.
Shi, Chengchun; Zhou, Yunzhe; Li, Lexin
picture_as_pdf
Testing for the Markov property in time series via deep conditional generative learning.
Zhou, Yunzhe; Shi, Chengchun; Li, Lexin; Yao, Qiwei
picture_as_pdf
Testing stationarity and change point detection in reinforcement learning.
Li, Mengbing; Shi, Chengchun; Wu, Zhenke; Fryzlewicz, Piotr
picture_as_pdf
Value enhancement of reinforcement learning via efficient and robust trust region optimization.
Shi, Chengchun; Qi, Zhengling; Wang, Jianing; Zhou, Fan
picture_as_pdf
An instrumental variable approach to confounded off-policy evaluation.
Xu, Yang; Zhu, Jin; Shi, Chengchun; Luo, Shikai; Song, Rui
picture_as_pdf
A massive data framework for M-estimators with cubic-rate.
Shi, Chengchun; Lu, Wenbin; Song, Rui
picture_as_pdf
A minimax learning approach to off-policy evaluation in confounded Partially Observable Markov Decision Processes.
Shi, Chengchun; Uehara, Masatoshi; Uehara, Masatoshi; Huang, Jiawei; Jiang, Nan
picture_as_pdf
An online sequential test for qualitative treatment effects.
Shi, Chengchun; Luo, Shikai; Zhu, Hongtu; Song, Rui
picture_as_pdf
A reinforcement learning framework for dynamic mediation analysis.
Ge, Lin; Wang, Jitao; Shi, Chengchun; Wu, Zhenke; Song, Rui
picture_as_pdf
A review of off-policy evaluation in reinforcement learning.
Uehara, Masatoshi; Shi, Chengchun; Kallus, Nathan
picture_as_pdf
A robust test for the stationarity assumption in sequential decision making.
Wang, Jitao; Shi, Chengchun; Wu, Zhenke
picture_as_pdf
simplexreg:an R package for regression analysis of proportional data using the simplex distribution.
Zhang, Peng; Qiu, Zhenguo; Shi, Chengchun
picture_as_pdf
A sparse random projection-based test for overall qualitative treatment effects.
Shi, Chengchun; Lu, Wenbin; Song, Rui
picture_as_pdf
Chapter
Deep jump learning for off-policy evaluation in continuous treatment settings.
Cai, Hengrui; Shi, Chengchun; Song, Rui; Lu, Wenbin
picture_as_pdf
Future-dependent value-based off-policy evaluation in POMDPs.
Uehara, Masatoshi; Kiyohara, Haruka; Bennett, Andrew; Chernozhukov, Victor; Jiang, Nan; Kallus, Nathan; Shi, Chengchun; Sun, Wenguang
picture_as_pdf
Optimal treatment allocation for efficient policy evaluation in sequential decision making.
Li, Ting; Shi, Chengchun; Wang, Jianing; Zhou, Fan; Zhu, Hongtu
picture_as_pdf
Conference or Workshop Item
Deeply-debiased off-policy interval estimation.
Shi, Chengchun; Wan, Runzhe; Chernozhukov, Victor; Song, Rui
picture_as_pdf
Does the Markov decision process fit the data:testing for the Markov property in sequential decision making.
Shi, Chengchun; Wan, Runzhe; Song, Rui; Lu, Wenbin; Leng, Ling
picture_as_pdf
Pattern transfer learning for reinforcement learning in order dispatching.
Wan, Runzhe; Zhang, Sheng; Shi, Chengchun; Luo, Shikai; Song, Rui
picture_as_pdf
Two-way deconfounder for off-policy evaluation in causal reinforcement learning.
Yu, Shuguang; Fang, Shuxing; Peng, Ruixin; Qi, Zhengling; Zhou, Fan; Shi, Chengchun
picture_as_pdf
A generalized method for dynamic noise inference in modeling sequential decision-making.
Li, Jing-Jing; Shi, Chengchun; Li, Lexin; Collins, Anne G.E.