Zhaoran Wang
North Western University
H-index: 41
Asia-Bangladesh
Top articles of Zhaoran Wang
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency | arXiv preprint arXiv:2309.17382 | Zhihan Liu Hao Hu Shenao Zhang Hongyi Guo Shuqi Ke | 2023/9/29 |
Pessimism in the Face of Confounders: Provably Efficient Offline Reinforcement Learning in Partially Observable Markov Decision Processes | Miao Lu Yifei Min Zhaoran Wang Zhuoran Yang | 2023 | |
What and How Does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization | arXiv preprint arXiv:2305.19420 | Yufeng Zhang Fengzhuo Zhang Zhuoran Yang Zhaoran Wang | 2023/5/30 |
Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments | Yixuan Wang Simon Sinong Zhan Ruochen Jiao Zhilu Wang Wanxin Jin | 2023 | |
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration | Zhihan Liu Miao Lu Wei Xiong Han Zhong Hao Hu | 2023 | |
Embed to Control Partially Observed Systems: Representation Learning with Provable Sample Efficiency | Lingxiao Wang Qi Cai Zhuoran Yang Zhaoran Wang | 2023 | |
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning | Shuang Qiu Lingxiao Wang Chenjia Bai Zhuoran Yang Zhaoran Wang | 2022 | |
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing | Advances in Neural Information Processing Systems (NeurIPS) 2022 | Rui Yang* Chenjia Bai* Xiaoteng Ma Zhaoran Wang Chongjie Zhang | 2022/6/6 |
Online Bootstrap Inference for Policy Evaluation in Reinforcement Learning | Journal of the American Statistical Association | Pratik Ramprasad Yuantong Li Zhuoran Yang Zhaoran Wang Will Wei Sun | 2022 |
Reinforcement Learning from Partial Observation: Linear Function Approximation with Provable Sample Efficiency | Qi Cai Zhuoran Yang Zhaoran Wang | 2022 | |
Policy Learning” Without’’Overlap: Pessimism and Generalized Empirical Bernstein’s Inequality | arXiv preprint arXiv:2212.09900 | Ying Jin Zhimei Ren Zhuoran Yang Zhaoran Wang | 2022/12/19 |
FinRL-Meta: Environments and Benchmarks for Financial Reinforcement Learning | Advances in Neural Information Processing Systems | Xiaoyang Liu Ziyi Xia Jingyang Rui Jiechao Gao Hongyang Yang | 2022 |
Nearly Dimension-Independent Sparse Linear Bandit over Small Action Spaces via Best Subset Selection | Journal of the American Statistical Association | Yi Chen Yining Wang Ethan X Fang Zhaoran Wang Runze Li | 2022 |
Sequential Information Design: Markov Persuasion Process and Its Efficient Reinforcement Learning | arXiv preprint arXiv:2202.10678 | Jibang Wu Zixuan Zhang Zhe Feng Zhaoran Wang Zhuoran Yang | 2022/2/22 |
Pessimistic Minimax Value Iteration: Provably Efficient Equilibrium Learning from Offline Datasets | Han Zhong Wei Xiong Jiyuan Tan Liwei Wang Tong Zhang | 2022/6/28 | |
GEC: A Unified Framework for Interactive Decision Making in MDP, POMDP, and Beyond | arXiv preprint arXiv:2211.01962 | Han Zhong Wei Xiong Sirui Zheng Liwei Wang Zhaoran Wang | 2022/11/3 |
Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium | Mathematics of Operations Research/Annual Conference on Learning Theory | Qiaomin Xie Yudong Chen Zhaoran Wang Zhuoran Yang | 2022 |
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning | Chenjia Bai Lingxiao Wang Zhuoran Yang Zhihong Deng Animesh Garg | 2022 | |
Human-In-The-Loop: Provably Efficient Preference-Based Reinforcement Learning with General Function Approximation | Xiaoyu Chen Han Zhong Zhuoran Yang Zhaoran Wang Liwei Wang | 2022/6/28 | |
Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes | arXiv preprint arXiv:2209.08666 | Zuyue Fu Zhengling Qi Zhaoran Wang Zhuoran Yang Yanxun Xu | 2022/9/18 |