Ma, Tao

Number of items: 3.

Public

Ma, Tao, Yang, Xuzhi, Szabo, Zoltan (2024). To switch or not to switch? Balanced policy switching in offline reinforcement learning. arXiv. https://doi.org/10.48550/arXiv.2407.01837

Restricted

Ma, Tao, Zhu, Jin, Cai, Hengrui, Qi, Zhengling, Chen, Yunxiao, Shi, Chengchun, Laber, Eric B. (2026). Sequential knockoffs for variable selection in reinforcement learning. Journal of the American Statistical Association, [In Press]

Ma, Tao (2025). Trustworthy decision making with sustainability [Doctoral thesis]. London School of Economics and Political Science. https://doi.org/10.21953/researchonline.lse.ac.uk.00137111

Up a level

EndNote

BibTeX

Reference Manager (RIS)

Refer

Dublin Core

JSON

Multiline CSV

Atom RSS

Public
Restricted