Shimon Whiteson
University of Oxford
H-index: 64
Europe-United Kingdom
Top articles of Shimon Whiteson
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control | arXiv preprint arXiv:2402.06570 | Zheng Xiong Risto Vuorio Jacob Beck Matthieu Zimmer Kun Shao | 2024/2/9 |
SplAgger: Split Aggregation for Meta-Reinforcement Learning | arXiv preprint arXiv:2403.03020 | Jacob Beck Matthew Jackson Risto Vuorio Zheng Xiong Shimon Whiteson | 2024/3/5 |
Discovering Temporally-Aware Reinforcement Learning Algorithms | Matthew Thomas Jackson Chris Lu Louis Kirsch Robert Tjarko Lange Shimon Whiteson | 2023/12/7 | |
Discovering general reinforcement learning algorithms with adversarial environment design | Advances in Neural Information Processing Systems | Matthew T Jackson Minqi Jiang Jack Parker-Holder Risto Vuorio Chris Lu | 2024/2/13 |
Recurrent Hypernetworks are Surprisingly Strong in Meta-RL | Advances in Neural Information Processing Systems | Jacob Beck Risto Vuorio Zheng Xiong Shimon Whiteson | 2024/2/13 |
The waymo open sim agents challenge | Neural Information Processing Systems (NeurIPS) 2023, Track on Datasets and Benchmarks | Nico Montali John Lambert Paul Mougin Alex Kuefler Nick Rhinehart | 2023/5/19 |
JaxMARL: Multi-Agent RL Environments and Algorithms in JAX | Alexander Rutherford Benjamin Ellis Matteo Gallici Jonathan Cook Andrei Lupu | 2024/5/6 | |
Smacv2: An improved benchmark for cooperative multi-agent reinforcement learning | Advances in Neural Information Processing Systems | Benjamin Ellis Jonathan Cook Skander Moalla Mikayel Samvelyan Mingfei Sun | 2024/2/13 |
Policy-Guided Diffusion | arXiv preprint arXiv:2404.06356 | Matthew Thomas Jackson Michael Tryfan Matthews Cong Lu Benjamin Ellis Shimon Whiteson | 2024/4/9 |
Particle-Based Score Estimation for State Space Model Learning in Autonomous Driving | Angad Singh Omar Makhlouf Maximilian Igl Joao Messias Arnaud Doucet | 2023/3/6 | |
Why target networks stabilise temporal difference methods | Mattie Fellows Matthew JA Smith Shimon Whiteson | 2023/7/3 | |
Hypernetworks in meta-reinforcement learning | Jacob Beck Matthew Thomas Jackson Risto Vuorio Shimon Whiteson | 2023/3/6 | |
Universal morphology control via contextual modulation | ICML 2023 | Zheng Xiong Jacob Beck Shimon Whiteson | 2023/2/22 |
Jaxmarl: Multi-agent rl environments in jax | AAMAS 2024 | Alexander Rutherford Benjamin Ellis Matteo Gallici Jonathan Cook Andrei Lupu | 2023/11/16 |
Trust-region-free policy optimization for stochastic policies | arXiv preprint arXiv:2302.07985 | Mingfei Sun Benjamin Ellis Anuj Mahajan Sam Devlin Katja Hofmann | 2023/2/15 |
Cheap talk discovery and utilization in multi-agent reinforcement learning | ICLR 2023 | Yat Long Lo Christian Schroeder de Witt Samuel Sokota Jakob Nicolaus Foerster Shimon Whiteson | 2023/3/19 |
Hierarchical Imitation Learning for Stochastic Environments | Maximilian Igl Punit Shah Paul Mougin Sirish Srinivasan Tarun Gupta | 2023/10/1 | |
A survey of meta-reinforcement learning | arXiv preprint arXiv:2301.08028 | Jacob Beck Risto Vuorio Evan Zheran Liu Zheng Xiong Luisa Zintgraf | 2023/1/19 |
Generating simulated agent trajectories using parallel beam search | 2023/3/16 | ||
Imitation is not enough: Robustifying imitation with reinforcement learning for challenging driving scenarios | Yiren Lu Justin Fu George Tucker Xinlei Pan Eli Bronstein | 2023/10/1 |