Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning

Zihan Zhang Department of Automation, Tsinghua University Yuhang Jiang Department of Automation, Tsinghua University Yuan Zhou Yau Mathematical Sciences Center & Department of Mathematical Sciences, Tsinghua University Xiangyang Ji Department of Automation, Tsinghua University

Machine Learning mathscidoc:2302.41002

Conference on Neural Information Processing Systems (NeurIPS), 2022.12
No abstract uploaded!
No keywords uploaded!
[ Download ] [ 2023-02-27 16:35:50 uploaded by zhouyuan ] [ 636 downloads ] [ 0 comments ]
@inproceedings{zihan2022near-optimal,
  title={Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning},
  author={Zihan Zhang, Yuhang Jiang, Yuan Zhou, and Xiangyang Ji},
  url={http://archive.ymsc.tsinghua.edu.cn/pacm_paperurl/20230227163550929080748},
  booktitle={Conference on Neural Information Processing Systems (NeurIPS)},
  year={2022},
}
Zihan Zhang, Yuhang Jiang, Yuan Zhou, and Xiangyang Ji. Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning. 2022. In Conference on Neural Information Processing Systems (NeurIPS). http://archive.ymsc.tsinghua.edu.cn/pacm_paperurl/20230227163550929080748.
Please log in for comment!
 
 
Contact us: office-iccm@tsinghua.edu.cn | Copyright Reserved