Zhiwei Steven Wu
Carnegie Mellon University
H-index: 43
North America-United States
Top articles of Zhiwei Steven Wu
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Differentially Private Bayesian Persuasion | arXiv preprint arXiv:2402.15872 | Yuqi Pan Zhiwei Steven Wu Haifeng Xu Shuran Zheng | 2024/2/24 |
On the sublinear regret of GP-UCB | Advances in Neural Information Processing Systems | Justin Whitehouse Aaditya Ramdas Steven Z Wu | 2024/2/13 |
Adaptive privacy composition for accuracy-first mechanisms | Advances in Neural Information Processing Systems | Ryan M Rogers Gennady Samorodnitsk Steven Z Wu Aaditya Ramdas | 2024/2/13 |
Regret Minimization in Stackelberg Games with Side Information | arXiv preprint arXiv:2402.08576 | Keegan Harris Zhiwei Steven Wu Maria-Florina Balcan | 2024/2/13 |
The Virtues of Pessimism in Inverse Reinforcement Learning | arXiv preprint arXiv:2402.02616 | David Wu Gokul Swamy J Andrew Bagnell Zhiwei Steven Wu Sanjiban Choudhury | 2024/2/4 |
Scalable membership inference attacks via quantile regression | Advances in Neural Information Processing Systems | Martin Bertran Shuai Tang Aaron Roth Michael Kearns Jamie H Morgenstern | 2024/2/13 |
Oracle-Efficient Differentially Private Learning with Public Data | arXiv preprint arXiv:2402.09483 | Adam Block Mark Bun Rathin Desai Abhishek Shetty Steven Wu | 2024/2/13 |
Meta-Learning Adversarial Bandit Algorithms | Advances in Neural Information Processing Systems | Misha Khodak Ilya Osadchiy Keegan Harris Maria-Florina F Balcan Kfir Y Levy | 2024/2/13 |
A minimaximalist approach to reinforcement learning from human feedback | arXiv preprint arXiv:2401.04056 | Gokul Swamy Christoph Dann Rahul Kidambi Zhiwei Steven Wu Alekh Agarwal | 2024/1/8 |
Predictive Performance Comparison of Decision Policies Under Confounding | arXiv preprint arXiv:2404.00848 | Luke Guerdan Amanda Coston Kenneth Holstein Zhiwei Steven Wu | 2024/4/1 |
Adaptive Principal Component Regression with Applications to Panel Data | Advances in Neural Information Processing Systems | Anish Agarwal Keegan Harris Justin Whitehouse Steven Z Wu | 2024/2/13 |
Provable Multi-Party Reinforcement Learning with Diverse Human Feedback | arXiv preprint arXiv:2403.05006 | Huiying Zhong Zhun Deng Weijie J Su Zhiwei Steven Wu Linjun Zhang | 2024/3/8 |
Hybrid Inverse Reinforcement Learning | arXiv preprint arXiv:2402.08848 | Juntao Ren Gokul Swamy Zhiwei Steven Wu J Andrew Bagnell Sanjiban Choudhury | 2024/2/13 |
Learning shared safety constraints from multi-task demonstrations | Advances in Neural Information Processing Systems | Konwoo Kim Gokul Swamy Zuxin Liu Ding Zhao Sanjiban Choudhury | 2024/2/13 |
Stackelberg Games with Side Information | Keegan Harris Steven Wu Maria Florina Balcan | 2023/10/31 | |
Strategic apple tasting | Advances in Neural Information Processing Systems | Keegan Harris Chara Podimata Steven Z Wu | 2024/2/13 |
Choosing public datasets for private machine learning via gradient subspace distance | arXiv preprint arXiv:2303.01256 | Xin Gu Gautam Kamath Zhiwei Steven Wu | 2023/3/2 |
Ground (less) truth: A causal framework for proxy labels in human-algorithm decision-making | Luke Guerdan Amanda Coston Zhiwei Steven Wu Kenneth Holstein | 2023/6/12 | |
Generating private synthetic data with genetic algorithms | Terrance Liu Jingwu Tang Giuseppe Vietri Steven Wu | 2023/7/3 | |
Time-uniform self-normalized concentration for vector-valued processes | arXiv preprint arXiv:2310.09100 | Justin Whitehouse Zhiwei Steven Wu Aaditya Ramdas | 2023/10/13 |