Longbo Huang
Tsinghua University
H-index: 36
Asia-China
Top articles of Longbo Huang
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation | arXiv preprint arXiv:2402.18159 | Yu Chen Xiangcheng Zhang Siwei Wang Longbo Huang | 2024/2/28 |
Provably Efficient Partially Observable Risk-Sensitive Reinforcement Learning with Hindsight Observation | arXiv preprint arXiv:2402.18149 | Tonghe Zhang Yu Chen Longbo Huang | 2024/2/28 |
Provably safe reinforcement learning with step-wise violation constraints | Advances in Neural Information Processing Systems | Nuoya Xiong Yihan Du Longbo Huang | 2024/2/13 |
Multi-User Delay-Constrained Scheduling With Deep Recurrent Reinforcement Learning | IEEE/ACM Transactions on Networking | Pihe Hu Yu Chen Ling Pan Zhixuan Fang Fu Xiao | 2024/2/9 |
When Lyapunov Drift Based Queue Scheduling Meets Adversarial Bandit Learning | IEEE/ACM Transactions on Networking | Jiatai Huang Leana Golubchik Longbo Huang | 2024/3/26 |
RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning | arXiv preprint arXiv:2403.04344 | Boning Li Zhixuan Fang Longbo Huang | 2024/3/7 |
Stochastic generative flow networks | Ling Pan Dinghuai Zhang Moksh Jain Longbo Huang Yoshua Bengio | 2023/7/2 | |
A Quadratic Synchronization Rule for Distributed Deep Learning | arXiv preprint arXiv:2310.14423 | Xinran Gu Kaifeng Lyu Sanjeev Arora Jingzhao Zhang Longbo Huang | 2023/10/22 |
Online Min-max Problems with Non-convexity and Non-stationarity | Transactions on Machine Learning Research | Yu Huang Yuan Cheng Yingbin Liang Longbo Huang | 2023/3/31 |
Provably efficient iterated cvar reinforcement learning with function approximation | Michael Maynord Eadom T Dessalene Cornelia Fermuller Yiannis Aloimonos | 2022/9/29 | |
RePreM: representation pre-training with masked model for reinforcement learning | Proceedings of the AAAI Conference on Artificial Intelligence | Yuanying Cai Chuheng Zhang Wei Shen Xuyun Zhang Wenjie Ruan | 2023/6/26 |
One is More: Diverse Perspectives within a Single Network for Efficient DRL | arXiv preprint arXiv:2310.14009 | Yiqin Tan Ling Pan Longbo Huang | 2023/10/21 |
Queue scheduling with adversarial bandit learning | arXiv preprint arXiv:2303.01745 | Jiatai Huang Leana Golubchik Longbo Huang | 2023/3/3 |
Network Optimization Techniques | Longbo Huang | 2023/6/20 | |
Beyond conservatism: Diffusion policies in offline multi-agent reinforcement learning | arXiv preprint arXiv:2307.01472 | Zhuoran Li Ling Pan Longbo Huang | 2023/7/4 |
Optimal Action Abstraction for Imperfect Information Extensive-Form Games | Boning Li Zhixuan Fang Longbo Huang | 2023/10/13 | |
Why (and When) does Local SGD Generalize Better than SGD? | arXiv preprint arXiv:2303.01215 | Xinran Gu Kaifeng Lyu Longbo Huang Sanjeev Arora | 2023/3/2 |
The Stochastic Network Model | Longbo Huang | 2023/6/20 | |
Multi-task representation learning for pure exploration in linear bandits | Yihan Du Longbo Huang Wen Sun | 2023/7/3 | |
MAST: A Sparse Training Framework for Multi-agent Reinforcement Learning | Pihe Hu Shaolong Li Longbo Huang | 2023/10/13 |