ProfessorsProfessors of Tsinghua UniversityLongbo Huang

Longbo Huang

Tsinghua University

H-index: 36

Asia-China

About Longbo Huang

Longbo Huang, With an exceptional h-index of 36 and a recent h-index of 30 (since 2020), a distinguished researcher at Tsinghua University, specializes in the field of Reinforcement Learning (RL), Deep RL, Machine Learning, Stochastic Networks.

His recent articles reflect a diverse array of research interests and contributions to the field:

Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation

Provably Efficient Partially Observable Risk-Sensitive Reinforcement Learning with Hindsight Observation

Provably safe reinforcement learning with step-wise violation constraints

Multi-User Delay-Constrained Scheduling With Deep Recurrent Reinforcement Learning

When Lyapunov Drift Based Queue Scheduling Meets Adversarial Bandit Learning

RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning

Stochastic generative flow networks

A Quadratic Synchronization Rule for Distributed Deep Learning

Longbo Huang Information

University	Tsinghua University
Position	Associate Professor IIIS @ China
Citations(all)	4724
Citations(since 2020)	2877
Cited By	2586
hIndex(all)	36
hIndex(since 2020)	30
i10Index(all)	77
i10Index(since 2020)	64
Email	Access Email
University Profile Page	Tsinghua University
Google Scholar	View Google Scholar Profile

Longbo Huang Skills & Research Interests

Reinforcement Learning (RL)

Deep RL

Machine Learning

Stochastic Networks

Top articles of Longbo Huang

Title	Journal	Author(s)	Publication Date
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation	arXiv preprint arXiv:2402.18159	Yu Chen Xiangcheng Zhang Siwei Wang Longbo Huang	2024/2/28
Provably Efficient Partially Observable Risk-Sensitive Reinforcement Learning with Hindsight Observation	arXiv preprint arXiv:2402.18149	Tonghe Zhang Yu Chen Longbo Huang	2024/2/28
Provably safe reinforcement learning with step-wise violation constraints	Advances in Neural Information Processing Systems	Nuoya Xiong Yihan Du Longbo Huang	2024/2/13
Multi-User Delay-Constrained Scheduling With Deep Recurrent Reinforcement Learning	IEEE/ACM Transactions on Networking	Pihe Hu Yu Chen Ling Pan Zhixuan Fang Fu Xiao ...	2024/2/9
When Lyapunov Drift Based Queue Scheduling Meets Adversarial Bandit Learning	IEEE/ACM Transactions on Networking	Jiatai Huang Leana Golubchik Longbo Huang	2024/3/26
RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning	arXiv preprint arXiv:2403.04344	Boning Li Zhixuan Fang Longbo Huang	2024/3/7
Stochastic generative flow networks		Ling Pan Dinghuai Zhang Moksh Jain Longbo Huang Yoshua Bengio	2023/7/2
A Quadratic Synchronization Rule for Distributed Deep Learning	arXiv preprint arXiv:2310.14423	Xinran Gu Kaifeng Lyu Sanjeev Arora Jingzhao Zhang Longbo Huang	2023/10/22
Online Min-max Problems with Non-convexity and Non-stationarity	Transactions on Machine Learning Research	Yu Huang Yuan Cheng Yingbin Liang Longbo Huang	2023/3/31
Provably efficient iterated cvar reinforcement learning with function approximation		Michael Maynord Eadom T Dessalene Cornelia Fermuller Yiannis Aloimonos	2022/9/29
RePreM: representation pre-training with masked model for reinforcement learning	Proceedings of the AAAI Conference on Artificial Intelligence	Yuanying Cai Chuheng Zhang Wei Shen Xuyun Zhang Wenjie Ruan ...	2023/6/26
One is More: Diverse Perspectives within a Single Network for Efficient DRL	arXiv preprint arXiv:2310.14009	Yiqin Tan Ling Pan Longbo Huang	2023/10/21
Queue scheduling with adversarial bandit learning	arXiv preprint arXiv:2303.01745	Jiatai Huang Leana Golubchik Longbo Huang	2023/3/3
Network Optimization Techniques		Longbo Huang	2023/6/20
Beyond conservatism: Diffusion policies in offline multi-agent reinforcement learning	arXiv preprint arXiv:2307.01472	Zhuoran Li Ling Pan Longbo Huang	2023/7/4
Optimal Action Abstraction for Imperfect Information Extensive-Form Games		Boning Li Zhixuan Fang Longbo Huang	2023/10/13
Why (and When) does Local SGD Generalize Better than SGD?	arXiv preprint arXiv:2303.01215	Xinran Gu Kaifeng Lyu Longbo Huang Sanjeev Arora	2023/3/2
The Stochastic Network Model		Longbo Huang	2023/6/20
Multi-task representation learning for pure exploration in linear bandits		Yihan Du Longbo Huang Wen Sun	2023/7/3
MAST: A Sparse Training Framework for Multi-agent Reinforcement Learning		Pihe Hu Shaolong Li Longbo Huang	2023/10/13