Chenjia Bai
Harbin Institute of Technology
H-index: 10
Asia-China
Top articles of Chenjia Bai
Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning
arXiv preprint arXiv:2404.06188
2024/4/9
Xudong Yu
H-Index: 9
Chenjia Bai
H-Index: 2
Hongyi Guo
H-Index: 1
Changhong Wang
H-Index: 6
Zhen Wang
H-Index: 42
Regularized Conditional Diffusion Model for Multi-Task Preference Alignment
arXiv preprint arXiv:2404.04920
2024/4/7
Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning
arXiv preprint arXiv:2402.14407
2024/2/22
Skill Matters: Dynamic Skill Learning for Multi-Agent Cooperative Reinforcement Learning
Available at SSRN 4790564
2024
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments
Proceedings of the AAAI Conference on Artificial Intelligence
2024/3/24
Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning
Artificial Intelligence
2024/1/1
Chenjia Bai
H-Index: 2
Lingxiao Wang
H-Index: 1
Zhuoran Yang
H-Index: 1
Bin Zhao
H-Index: 18
Zhen Wang
H-Index: 42
Robust Quadrupedal Locomotion via Risk-Averse Policy Learning
arXiv preprint arXiv:2308.09405
2023/8/18
Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning
arXiv preprint arXiv:2404.19292
2024/4/30
False Correlation Reduction for Offline Reinforcement Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence
2023/10/30
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness
arXiv preprint arXiv:2309.16973
2023/9/29
Self-Supervised Imitation for Offline Reinforcement Learning With Hindsight Relabeling
IEEE Transactions on Systems, Man, and Cybernetics: Systems
2023/8/17
Privileged Knowledge Distillation for Sim-to-Real Policy Generalization
arXiv preprint arXiv:2305.18464
2023/5/29
On the Value of Myopic Behavior in Policy Reuse
arXiv preprint arXiv:2305.17623
2023/5/28
Exploration in Deep Reinforcement Learning: From Single-Agent to Multi-Agent Domain
IEEE Transactions on Neural Networks and Learning Systems
2023/1/19
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
Advances in neural information processing systems
2024/2/13
Cross-Domain Policy Adaptation via Value-Guided Data Filtering
Advances in Neural Information Processing Systems
2024/2/13
Behavior Contrastive Learning for Unsupervised Skill Discovery
International Conference on Machine Learning
2023/5/8
Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning
IEEE Transactions on Neural Networks and Learning Systems
2022/11/4
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
2022
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
Advances in Neural Information Processing Systems (NeurIPS) 2022
2022/6/6