Jiaqi Yang
Tsinghua University
H-index: 9
Asia-China
Top articles of Jiaqi Yang
Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased
arXiv preprint arXiv:2302.01605
2023/2/3
Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning
2022/6/28
Revisiting some common practices in cooperative multi-agent reinforcement learning
ICML 2022
2022/6/15
Nearly Minimax Algorithms for Linear Bandits with Shared Representation
arXiv preprint arXiv:2203.15664
2022/3/29
Jiaqi Yang
H-Index: 1
Qi Lei
H-Index: 2
Optimal Gradient-based Algorithms for Non-concave Bandit Optimization
Advances in Neural Information Processing Systems
2021/12/6
Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP
Advances in Neural Information Processing Systems
2021/12/6
Zihan Zhang
H-Index: 4
Jiaqi Yang
H-Index: 1
Going Beyond Linear RL: Sample Efficient Neural Function Approximation
Advances in Neural Information Processing Systems
2021/12/6
Provable Model-Based Nonlinear Bandit and Reinforcement Learning: Shelve Optimism, Embrace Virtual Curvature
Advances in neural information processing systems
2021/12/6
Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design
2021/6/15
Jiaqi Yang
H-Index: 1
Yuan Zhou
H-Index: 16
Fully Gap-Dependent Bounds for Multinomial Logit Bandit
2021/3/18
Jiaqi Yang
H-Index: 1
Impact of Representation Learning in Linear Bandits
2020/10/2