MathSciDoc: An Archive for Mathematician ∫

Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning

Zihan Zhang Department of Automation, Tsinghua University Yuhang Jiang Department of Automation, Tsinghua University Yuan Zhou Yau Mathematical Sciences Center & Department of Mathematical Sciences, Tsinghua University Xiangyang Ji Department of Automation, Tsinghua University

Machine Learning mathscidoc:2302.41002

Conference on Neural Information Processing Systems (NeurIPS), 2022.12

No abstract uploaded!

No keywords uploaded!

[ Download ] [ 2023-02-27 16:35:50 uploaded by zhouyuan ] [ 2905 downloads ] [ 0 comments ]

BibTex
Ref

@inproceedings{zihan2022near-optimal,
  title={Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning},
  author={Zihan Zhang, Yuhang Jiang, Yuan Zhou, and Xiangyang Ji},
  url={http://archive.ymsc.tsinghua.edu.cn/pacm_paperurl/20230227163550929080748},
  booktitle={Conference on Neural Information Processing Systems (NeurIPS)},
  year={2022},
}

Zihan Zhang, Yuhang Jiang, Yuan Zhou, and Xiangyang Ji. Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning. 2022. In Conference on Neural Information Processing Systems (NeurIPS). http://archive.ymsc.tsinghua.edu.cn/pacm_paperurl/20230227163550929080748.

Please log in for comment!

MathSciDoc: An Archive for Mathematician ∫

Log In

Sign Up

Help

Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning

Zihan Zhang Department of Automation, Tsinghua University Yuhang Jiang Department of Automation, Tsinghua University Yuan Zhou Yau Mathematical Sciences Center & Department of Mathematical Sciences, Tsinghua University Xiangyang Ji Department of Automation, Tsinghua University

Machine Learning mathscidoc:2302.41002

Contact us: office-iccm@tsinghua.edu.cn | Copyright Reserved