Yee Whye Teh
University of Oxford
H-index: 81
Europe-United Kingdom
Top articles of Yee Whye Teh
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Deep Stochastic Processes via Functional Markov Transition Operators | Advances in Neural Information Processing Systems | Jin Xu Emilien Dupont Kaspar Märtens Thomas Rainforth Yee Whye Teh | 2024/2/13 |
Revisiting Dynamic Evaluation: Online Adaptation for Large Language Models | arXiv preprint arXiv:2403.01518 | Amal Rannen-Triki Jorg Bornschein Razvan Pascanu Marcus Hutter Andras György | 2024/3/3 |
Position Paper: Bayesian Deep Learning in the Age of Large-Scale AI | Theodore Papamarkou Maria Skoularidou Konstantina Palla Laurence Aitchison Julyan Arbel | 2024/2 | |
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models | arXiv preprint arXiv:2402.19427 | Soham De Samuel L Smith Anushan Fernando Aleksandar Botev George Cristian-Muraru | 2024/2/29 |
The Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning | Second Agent Learning in Open-Endedness Workshop | Anya Sims Cong Lu Yee Whye Teh | 2024/2/19 |
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models | arXiv preprint arXiv:2404.07839 | Aleksandar Botev Soham De Samuel L Smith Anushan Fernando George-Cristian Muraru | 2024/4/11 |
Synthetic experience replay | NeurIPS 2023 | Cong Lu Philip J Ball Yee Whye Teh Jack Parker-Holder | 2023/3/12 |
Unleashing the Power of Meta-tuning for Few-shot Generalization Through Sparse Interpolated Experts | arXiv preprint arXiv:2403.08477 | Shengzhuang Chen Jihoon Tack Yunqiao Yang Yee Whye Teh Jonathan Richard Schwarz | 2024/3/13 |
Geometric neural diffusion processes | Advances in Neural Information Processing Systems | Emile Mathieu Vincent Dutordoir Michael Hutchinson Valentin De Bortoli Yee Whye Teh | 2024/2/13 |
Online Adaptation of Language Models with a Memory of Amortized Contexts | arXiv preprint arXiv:2403.04317 | Jihoon Tack Jaehyung Kim Eric Mitchell Jinwoo Shin Yee Whye Teh | 2024/3/7 |
Incorporating unlabelled data into Bayesian neural networks | arXiv preprint arXiv:2304.01762 | Mrinank Sharma Tom Rainforth Yee Whye Teh Vincent Fortuin | 2023/4/4 |
EvIL: Evolution Strategies for Generalisable Imitation Learning | Silvia Sapora Chris Lu Gokul Swamy Yee Whye Teh Jakob Nicolaus Foerster | 2023/10/13 | |
When Does Re-initialization Work? | Sheheryar Zaidi Tudor Berariu Hyunjik Kim Jorg Bornschein Claudia Clopath | 2023/2/28 | |
Daisee: Adaptive importance sampling by balancing exploration and exploitation | Scandinavian Journal of Statistics | Xiaoyu Lu Tom Rainforth Yee Whye Teh | 2023 |
Stochastic linear dynamics in parameters to deal with Neural Networks plasticity loss | Alexandre Galashov Michalis Titsias Razvan Pascanu Yee Whye Teh Maneesh Sahani | 2023/12/7 | |
Deep transformers without shortcuts: Modifying self-attention for faithful signal propagation | arXiv preprint arXiv:2302.10322 | Bobby He James Martens Guodong Zhang Aleksandar Botev Andrew Brock | 2023/2/20 |
Selfcheck: Using llms to zero-shot check their own step-by-step reasoning | International Conference on Learning Representations | Ning Miao Yee Whye Teh Tom Rainforth | 2024 |
Continually learning representations at scale | Alexandre Galashov Jovana Mitrovic Dhruva Tirumala Yee Whye Teh Timothy Nguyen | 2023/11/20 | |
Modality-agnostic variational compression of implicit neural representations | arXiv preprint arXiv:2301.09479 | Jonathan Richard Schwarz Jihoon Tack Yee Whye Teh Jaeho Lee Jinwoo Shin | 2023/1/23 |
Drug discovery under covariate shift with domain-informed prior distributions over functions | Leo Klarner Tim GJ Rudner Michael Reutlinger Torsten Schindler Garrett M Morris | 2023/7 |