LSE logo

LSE Research Online

Home home About fingerprint Deposit upload Policies policy Statistics bar_chart
  • Department school Item type interests LSE creators person Funder currency_pound Year calendar_month
  • Admin login login

    Ma, Tao

    Number of items: 3.
    Public
  • Ma, Tao, Yang, Xuzhi, Szabo, Zoltan (2024). To switch or not to switch? Balanced policy switching in offline reinforcement learning. arXiv. https://doi.org/10.48550/arXiv.2407.01837 picture_as_pdf
  • Restricted
  • Ma, Tao, Zhu, Jin, Cai, Hengrui, Qi, Zhengling, Chen, Yunxiao, Shi, Chengchun, Laber, Eric B. (2026). Sequential knockoffs for variable selection in reinforcement learning. Journal of the American Statistical Association, [In Press] picture_as_pdf
  • Ma, Tao (2025). Trustworthy decision making with sustainability [Doctoral thesis]. London School of Economics and Political Science. https://doi.org/10.21953/researchonline.lse.ac.uk.00137111 picture_as_pdf
  • arrow_upwardUp a level
    EndNote BibTeX Reference Manager (RIS) Refer Dublin Core JSON Multiline CSV
    rss_feedAtom rss_feedRSS

    1. Public
    2. Restricted
    LSE Logo
    EPrints Logo EPrints Publications Flavour Logo
    CoSector Logo
    • Contact Us
    • Policies
    • Accessibility Statement
    LSE Research Online is powered by EPrints 3.4 and is hosted and managed by CoSector, University of London
    LSE Research Online supports OAI 2.0