Aldo Pacchiano
University of California, Berkeley
H-index: 21
North America-United States
Top articles of Aldo Pacchiano
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
A Unified Model and Dimension for Interactive Estimation | Advances in Neural Information Processing Systems | Nataly Brukhim Miro Dudik Aldo Pacchiano Robert E Schapire | 2024/2/13 |
Multiple-policy Evaluation via Density Estimation | arXiv preprint arXiv:2404.00195 | Yilei Chen Aldo Pacchiano Ioannis Ch Paschalidis | 2024/3/29 |
A Framework for Partially Observed Reward-States in RLHF | arXiv preprint arXiv:2402.03282 | Chinmaya Kausik Mirco Mutti Aldo Pacchiano Ambuj Tewari | 2024/2/5 |
Provably Sample Efficient RLHF via Active Preference Optimization | arXiv preprint arXiv:2402.10500 | Nirjhar Das Souradip Chakraborty Aldo Pacchiano Sayak Ray Chowdhury | 2024/2/16 |
Contextual Bandits with Stage-wise Constraints | arXiv preprint arXiv:2401.08016 | Aldo Pacchiano Mohammad Ghavamzadeh Peter Bartlett | 2024/1/15 |
Experiment planning with function approximation | Advances in Neural Information Processing Systems | Aldo Pacchiano Jonathan Lee Emma Brunskill | 2024/2/13 |
Anytime Model Selection in Linear Bandits | Advances in Neural Information Processing Systems | Parnian Kassraie Nicolas Emmenegger Andreas Krause Aldo Pacchiano | 2024/2/13 |
Data-Driven Online Model Selection With Regret Guarantees | Chris Dann Claudio Gentile Aldo Pacchiano | 2024/4/18 | |
Supervised pretraining can learn in-context reinforcement learning | Advances in Neural Information Processing Systems | Jonathan Lee Annie Xie Aldo Pacchiano Yash Chandak Chelsea Finn | 2024/2/13 |
Undo Maps: A Tool for Adapting Policies to Perceptual Distortions | Abhi Gupta Ted Moskovitz David Alvarez-Melis Aldo Pacchiano | 2023/7/9 | |
Dueling rl: Reinforcement learning with trajectory preferences | Aadirupa Saha Aldo Pacchiano Jonathan Lee | 2023/4/11 | |
In-Context Decision-Making from Supervised Pretraining | Jonathan Lee Annie Xie Aldo Pacchiano Yash Chandak Chelsea Finn | 2023/7/9 | |
Estimating Optimal Policy Value in General Linear Contextual Bandits | arXiv preprint arXiv:2302.09451 | Jonathan N Lee Weihao Kong Aldo Pacchiano Vidya Muthukumar Emma Brunskill | 2023/2/19 |
Leveraging offline data in online reinforcement learning | Andrew Wagenmaker Aldo Pacchiano | 2023/7/3 | |
An instance-dependent analysis for the cooperative multi-player multi-armed bandit | Aldo Pacchiano Peter Bartlett Michael Jordan | 2023/2/13 | |
Estimating Optimal Policy Value in Linear Contextual Bandits Beyond Gaussianity | Transactions on Machine Learning Research | Jonathan Lee Weihao Kong Aldo Pacchiano Vidya Muthukumar Emma Brunskill | 2023/9/18 |
Data-Driven Regret Balancing for Online Model Selection in Bandits | arXiv preprint arXiv:2306.02869 | Aldo Pacchiano Christoph Dann Claudio Gentile | 2023/6/5 |
Unbiased Decisions Reduce Regret: Adversarial Domain Adaptation for the Bank Loan Problem | arXiv preprint arXiv:2308.08051 | Elena Gal Shaun Singh Aldo Pacchiano Ben Walker Terry Lyons | 2023/8/15 |
Improving offline rl by blending heuristics | arXiv preprint arXiv:2306.00321 | Sinong Geng Aldo Pacchiano Andrey Kolobov Ching-An Cheng | 2023/6/1 |
Joint representation training in sequential tasks with shared structure | arXiv preprint arXiv:2206.12441 | Aldo Pacchiano Ofir Nachum Nilseh Tripuraneni Peter Bartlett | 2022/6/24 |