Jiaqi Yang

About Jiaqi Yang

Jiaqi Yang, With an exceptional h-index of 9 and a recent h-index of 9 (since 2020), a distinguished researcher at Tsinghua University, specializes in the field of Statistical Machine Learning.

His recent articles reflect a diverse array of research interests and contributions to the field:

Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased

Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning

Revisiting some common practices in cooperative multi-agent reinforcement learning

Nearly Minimax Algorithms for Linear Bandits with Shared Representation

Optimal Gradient-based Algorithms for Non-concave Bandit Optimization

Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP

Going Beyond Linear RL: Sample Efficient Neural Function Approximation

Provable Model-Based Nonlinear Bandit and Reinforcement Learning: Shelve Optimism, Embrace Virtual Curvature

Jiaqi Yang Information

University

Position

Institute for Interdisciplinary Information Sciences

Citations(all)

263

Citations(since 2020)

263

Cited By

9

hIndex(all)

9

hIndex(since 2020)

9

i10Index(all)

9

i10Index(since 2020)

9

Email

University Profile Page

Google Scholar

Jiaqi Yang Skills & Research Interests

Statistical Machine Learning

Top articles of Jiaqi Yang

Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased

arXiv preprint arXiv:2302.01605

2023/2/3

Phasic Self-Imitative Reduction for Sparse-Reward Goal-Conditioned Reinforcement Learning

2022/6/28

Revisiting some common practices in cooperative multi-agent reinforcement learning

ICML 2022

2022/6/15

Nearly Minimax Algorithms for Linear Bandits with Shared Representation

arXiv preprint arXiv:2203.15664

2022/3/29

Jiaqi Yang
Jiaqi Yang

H-Index: 1

Qi Lei
Qi Lei

H-Index: 2

Optimal Gradient-based Algorithms for Non-concave Bandit Optimization

Advances in Neural Information Processing Systems

2021/12/6

Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP

Advances in Neural Information Processing Systems

2021/12/6

Zihan Zhang
Zihan Zhang

H-Index: 4

Jiaqi Yang
Jiaqi Yang

H-Index: 1

Going Beyond Linear RL: Sample Efficient Neural Function Approximation

Advances in Neural Information Processing Systems

2021/12/6

Provable Model-Based Nonlinear Bandit and Reinforcement Learning: Shelve Optimism, Embrace Virtual Curvature

Advances in neural information processing systems

2021/12/6

Linear Bandits with Limited Adaptivity and Learning Distributional Optimal Design

2021/6/15

Jiaqi Yang
Jiaqi Yang

H-Index: 1

Yuan Zhou
Yuan Zhou

H-Index: 16

Fully Gap-Dependent Bounds for Multinomial Logit Bandit

2021/3/18

Jiaqi Yang
Jiaqi Yang

H-Index: 1

Impact of Representation Learning in Linear Bandits

2020/10/2

See List of Professors in Jiaqi Yang University(Tsinghua University)

Co-Authors

academic-engine