Haipeng Luo
University of Southern California
H-index: 34
North America-United States
Top articles of Haipeng Luo
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Regret matching+:(in) stability and fast convergence in games | Advances in Neural Information Processing Systems | Gabriele Farina Julien Grand-Clément Christian Kroer Chung-Wei Lee Haipeng Luo | 2024/2/13 |
Online learning in contextual second-price pay-per-click auctions | Mengxiao Zhang Haipeng Luo | 2024/4/18 | |
Improved best-of-both-worlds guarantees for multi-armed bandits: Ftrl with general regularizers and multiple optimal arms | Advances in Neural Information Processing Systems | Tiancheng Jin Junyan Liu Haipeng Luo | 2024/2/13 |
Tractable Local Equilibria in Non-Concave Games | arXiv preprint arXiv:2403.08171 | Yang Cai Constantinos Daskalakis Haipeng Luo Chen-Yu Wei Weiqiang Zheng | 2024/3/13 |
Contextual Multinomial Logit Bandits with General Value Functions | arXiv preprint arXiv:2402.08126 | Mengxiao Zhang Haipeng Luo | 2024/2/12 |
Practical contextual bandits with feedback graphs | NeurIPS 2023 | Mengxiao Zhang* Yuheng Zhang* Olga Vrousgou Haipeng Luo Paul Mineiro | 2023/2 |
Efficient Contextual Bandits with Uninformed Feedback Graphs | arXiv preprint arXiv:2402.08127 | Mengxiao Zhang Yuheng Zhang Haipeng Luo Paul Mineiro | 2024/2/12 |
Uncoupled and Convergent Learning in Two-Player Zero-Sum Markov Games with Bandit Feedback | Advances in Neural Information Processing Systems | Yang Cai Haipeng Luo Chen-Yu Wei Weiqiang Zheng | 2023/12 |
Near-Optimal Policy Optimization for Correlated Equilibrium in General-Sum Markov Games | arXiv preprint arXiv:2401.15240 | Yang Cai Haipeng Luo Chen-Yu Wei Weiqiang Zheng | 2024/1/26 |
No-Regret Online Reinforcement Learning with Adversarial Losses and Transitions | Advances in Neural Information Processing Systems | Tiancheng Jin Junyan Liu Chloé Rouyer William Chang Chen-Yu Wei | 2024/2/13 |
No-regret learning in two-echelon supply chain with unknown demand distribution | Mengxiao Zhang Shi Chen Haipeng Luo Yingfei Wang | 2023/4/11 | |
On Local Equilibrium in Non-Concave Games | Yang Cai Constantinos Costis Daskalakis Haipeng Luo Chen-Yu Wei Weiqiang Zheng | 2023/10/13 | |
Improved high-probability regret for adversarial bandits with time-varying feedback graphs | Haipeng Luo Hanghang Tong Mengxiao Zhang Yuheng Zhang | 2023/2/13 | |
2nd Workshop on Multi-Armed Bandits and Reinforcement Learning: Advancing Decision Making in E-Commerce and Beyond | Chu Wang Yingfei Wang Haipeng Luo Daniel Jiang Jinghai He | 2023/8/6 | |
Average-Constrained Policy Optimization | arXiv preprint arXiv:2302.00808 | Akhil Agnihotri Rahul Jain Haipeng Luo | 2023/2/2 |
Refined regret for adversarial mdps with linear function approximation | Yan Dai Haipeng Luo Chen-Yu Wei Julian Zimmert | 2023/7/3 | |
Coordination under Unknown Demand Distribution: Online Learning for Two-Echelon Supply Chains | Shi Chen Haipeng Luo Yingfei Wang Mengxiao Zhang | 2023 | |
Posterior sampling-based online learning for the stochastic shortest path model | Mehdi Jafarnia-Jahromi Liyu Chen Rahul Jain Haipeng Luo | 2023/7/2 | |
Supply Chain Coordination with Unknown Demand Distribution: No-Regret Learning | Available at SSRN 4456201 | Shi Chen Haipeng Luo Yingfei Wang Mengxiao Zhang | 2023/5/22 |
Last-Iterate Convergence Properties of Regret-Matching Algorithms in Games | arXiv preprint arXiv:2311.00676 | Yang Cai Gabriele Farina Julien Grand-Clément Christian Kroer Chung-Wei Lee | 2023/11/1 |