Pan Xu
University of California, Los Angeles
H-index: 22
North America-United States
Top articles of Pan Xu
Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning
arXiv preprint arXiv:2404.10728
2024/4/16
Finite-time frequentist regret bounds of multi-agent thompson sampling on sparse hypergraphs
Proceedings of the AAAI Conference on Artificial Intelligence
2024/3/24
Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning
arXiv preprint arXiv:2403.09621
2024/3/14
Distributionally Robust Off-Dynamics Reinforcement Learning: Provable Efficiency with Linear Function Approximation
arXiv preprint arXiv:2402.15399
2024/2/23
Convergence of Sign-based Random Reshuffling Algorithms for Nonconvex Optimization
arXiv preprint arXiv:2310.15976
2023/10/24
Optimal Batched Best Arm Identification
arXiv preprint arXiv:2310.14129
2023/10/21
PhyGCN: Pre-trained Hypergraph Convolutional Neural Networks with Self-supervised Learning
bioRxiv
2023/10/2
Wasserstein Distributionally Robust Policy Evaluation and Learning for Contextual Bandits
arXiv preprint arXiv:2309.08748
2023/9/15
Thompson sampling with less exploration is fast and optimal
2023/7/3
Queer In AI: A Case Study in Community-Led Participatory AI
2023/6/12
Anaelia Ovalle
H-Index: 2
Arjun Subramonian
H-Index: 2
Ashwin Singh
H-Index: 4
Claas Voelcker
H-Index: 1
Eva Breznik
H-Index: 1
Hang Yuan
H-Index: 17
Huan Zhang
H-Index: 7
Maarten Sap
H-Index: 23
Maria Leonor Pacheco
H-Index: 4
Maria Ryskina
H-Index: 1
Martin Mundt
H-Index: 7
Nyx Mclean
H-Index: 4
Pan Xu
H-Index: 21
Sarah Mathew
H-Index: 13
Sarthak Arora
H-Index: 2
William Agnew
H-Index: 1
Yanan Long
H-Index: 2
Avijit Ghosh
H-Index: 0
Nathaniel Dennler
H-Index: 0
Michael Noseworthy
H-Index: 5
Sharvani Jha
H-Index: 1
Aditya Joshi
H-Index: 4
Luke Stark
H-Index: 12
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
2023/5/29
Multiple models for outbreak decision support in the face of uncertainty
Proceedings of the National Academy of Sciences
2023/5/2
Katriona Shea
H-Index: 29
Cynthia Chen
H-Index: 16
Jinghui Chen
H-Index: 10
Shi Chen
H-Index: 4
Yangquan Chen
H-Index: 68
Richard C Gerkin
H-Index: 16
Quanquan Gu
H-Index: 37
Nathaniel Hupert
H-Index: 19
Daniel Janies
H-Index: 19
Ana Pastore Y Piontti
H-Index: 14
Rajib Paul
H-Index: 17
T Alex Perkins
H-Index: 29
Chrysm Watson Ross
H-Index: 1
Kok Ben Toh
H-Index: 10
Alessandro Vespignani
H-Index: 85
Lingxiao Wang
H-Index: 1
Pan Xu
H-Index: 21
Weitong Zhang
H-Index: 4
Difan Zou
H-Index: 15
Distributionally Robust Policy Gradient for Offline Contextual Bandits
2023/4/11
Global convergence of localized policy iteration in networked multi-agent reinforcement learning
Proceedings of the ACM on Measurement and Analysis of Computing Systems
2023/2/28
Neural Contextual Bandits with Deep Representation and Shallow Exploration
2022
Active Ranking without Strong Stochastic Transitivity
Advances in neural information processing systems
2022/12/6
Finite-time regret of thompson sampling algorithms for exponential family multi-armed bandits
Advances in Neural Information Processing Systems
2022/12/6
The United States COVID-19 Forecast Hub dataset
Scientific data
2022/8/1
Langevin Monte Carlo for Contextual Bandits
2022/6/28
Adaptive Sampling for Heterogeneous Rank Aggregation from Noisy Pairwise Comparisons
2022/5/3