Longbo Huang

Longbo Huang

Tsinghua University

H-index: 36

Asia-China

About Longbo Huang

Longbo Huang, With an exceptional h-index of 36 and a recent h-index of 30 (since 2020), a distinguished researcher at Tsinghua University, specializes in the field of Reinforcement Learning (RL), Deep RL, Machine Learning, Stochastic Networks.

His recent articles reflect a diverse array of research interests and contributions to the field:

Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation

Provably Efficient Partially Observable Risk-Sensitive Reinforcement Learning with Hindsight Observation

Provably safe reinforcement learning with step-wise violation constraints

Multi-User Delay-Constrained Scheduling With Deep Recurrent Reinforcement Learning

When Lyapunov Drift Based Queue Scheduling Meets Adversarial Bandit Learning

RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning

Stochastic generative flow networks

A Quadratic Synchronization Rule for Distributed Deep Learning

Longbo Huang Information

University

Position

Associate Professor IIIS @ China

Citations(all)

4724

Citations(since 2020)

2877

Cited By

2586

hIndex(all)

36

hIndex(since 2020)

30

i10Index(all)

77

i10Index(since 2020)

64

Email

University Profile Page

Tsinghua University

Google Scholar

View Google Scholar Profile

Longbo Huang Skills & Research Interests

Reinforcement Learning (RL)

Deep RL

Machine Learning

Stochastic Networks

Top articles of Longbo Huang

Title

Journal

Author(s)

Publication Date

Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation

arXiv preprint arXiv:2402.18159

Yu Chen

Xiangcheng Zhang

Siwei Wang

Longbo Huang

2024/2/28

Provably Efficient Partially Observable Risk-Sensitive Reinforcement Learning with Hindsight Observation

arXiv preprint arXiv:2402.18149

Tonghe Zhang

Yu Chen

Longbo Huang

2024/2/28

Provably safe reinforcement learning with step-wise violation constraints

Advances in Neural Information Processing Systems

Nuoya Xiong

Yihan Du

Longbo Huang

2024/2/13

Multi-User Delay-Constrained Scheduling With Deep Recurrent Reinforcement Learning

IEEE/ACM Transactions on Networking

Pihe Hu

Yu Chen

Ling Pan

Zhixuan Fang

Fu Xiao

...

2024/2/9

When Lyapunov Drift Based Queue Scheduling Meets Adversarial Bandit Learning

IEEE/ACM Transactions on Networking

Jiatai Huang

Leana Golubchik

Longbo Huang

2024/3/26

RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning

arXiv preprint arXiv:2403.04344

Boning Li

Zhixuan Fang

Longbo Huang

2024/3/7

Stochastic generative flow networks

Ling Pan

Dinghuai Zhang

Moksh Jain

Longbo Huang

Yoshua Bengio

2023/7/2

A Quadratic Synchronization Rule for Distributed Deep Learning

arXiv preprint arXiv:2310.14423

Xinran Gu

Kaifeng Lyu

Sanjeev Arora

Jingzhao Zhang

Longbo Huang

2023/10/22

Online Min-max Problems with Non-convexity and Non-stationarity

Transactions on Machine Learning Research

Yu Huang

Yuan Cheng

Yingbin Liang

Longbo Huang

2023/3/31

Provably efficient iterated cvar reinforcement learning with function approximation

Michael Maynord

Eadom T Dessalene

Cornelia Fermuller

Yiannis Aloimonos

2022/9/29

RePreM: representation pre-training with masked model for reinforcement learning

Proceedings of the AAAI Conference on Artificial Intelligence

Yuanying Cai

Chuheng Zhang

Wei Shen

Xuyun Zhang

Wenjie Ruan

...

2023/6/26

One is More: Diverse Perspectives within a Single Network for Efficient DRL

arXiv preprint arXiv:2310.14009

Yiqin Tan

Ling Pan

Longbo Huang

2023/10/21

Queue scheduling with adversarial bandit learning

arXiv preprint arXiv:2303.01745

Jiatai Huang

Leana Golubchik

Longbo Huang

2023/3/3

Network Optimization Techniques

Longbo Huang

2023/6/20

Beyond conservatism: Diffusion policies in offline multi-agent reinforcement learning

arXiv preprint arXiv:2307.01472

Zhuoran Li

Ling Pan

Longbo Huang

2023/7/4

Optimal Action Abstraction for Imperfect Information Extensive-Form Games

Boning Li

Zhixuan Fang

Longbo Huang

2023/10/13

Why (and When) does Local SGD Generalize Better than SGD?

arXiv preprint arXiv:2303.01215

Xinran Gu

Kaifeng Lyu

Longbo Huang

Sanjeev Arora

2023/3/2

The Stochastic Network Model

Longbo Huang

2023/6/20

Multi-task representation learning for pure exploration in linear bandits

Yihan Du

Longbo Huang

Wen Sun

2023/7/3

MAST: A Sparse Training Framework for Multi-agent Reinforcement Learning

Pihe Hu

Shaolong Li

Longbo Huang

2023/10/13

See List of Professors in Longbo Huang University(Tsinghua University)

Co-Authors

H-index: 227
Yoshua Bengio

Yoshua Bengio

Université de Montréal

H-index: 160
Georgios  B. Giannakis

Georgios B. Giannakis

University of Minnesota-Twin Cities

H-index: 101
Kannan Ramchandran

Kannan Ramchandran

University of California, Berkeley

H-index: 72
Eytan Modiano

Eytan Modiano

Massachusetts Institute of Technology

H-index: 57
Adam Wierman

Adam Wierman

California Institute of Technology

H-index: 54
Michael J. Neely

Michael J. Neely

University of Southern California

academic-engine