Allan Dafoe
University of Oxford
H-index: 34
Europe-United Kingdom
Top articles of Allan Dafoe
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
The Ethics of Advanced AI Assistants | arXiv preprint arXiv:2404.16244 | Iason Gabriel Arianna Manzini Geoff Keeling Lisa Anne Hendricks Verena Rieser | 2024/4/24 |
Holistic Safety and Responsibility Evaluations of Advanced AI Models | arXiv preprint arXiv:2404.14068 | Laura Weidinger Joslyn Barnhart Jenny Brennan Christina Butterfield Susie Young | 2024/4/22 |
Evaluating Frontier Models for Dangerous Capabilities | arXiv preprint arXiv:2403.13793 | Mary Phuong Matthew Aitchison Elliot Catt Sarah Cogan Alexandre Kaskasoli | 2024/3/20 |
AI Governance | The Oxford Handbook of AI Governance | Allan Dafoe | 2024/2/26 |
Democratising AI: Multiple meanings, goals, and methods | Elizabeth Seger Aviv Ovadya Divya Siddarth Ben Garfinkel Allan Dafoe | 2023/8/8 | |
Gemini: a family of highly capable multimodal models | arXiv preprint arXiv:2312.11805 | Gemini Team Rohan Anil Sebastian Borgeaud Yonghui Wu Jean-Baptiste Alayrac | 2023/12/19 |
Engines of power: Electricity, AI, and general-purpose, military transformations | European Journal of International Security | Jeffrey Ding Allan Dafoe | 2023/8 |
Leader age and international conflict: A regression discontinuity analysis | Journal of Peace Research | Andrew Bertoli Allan Dafoe Robert Trager | 2023/12/7 |
International institutions for advanced AI | arXiv preprint arXiv:2307.04699 | Lewis Ho Joslyn Barnhart Robert Trager Yoshua Bengio Miles Brundage | 2023/7/10 |
Levels of AGI: Operationalizing Progress on the Path to AGI | arXiv preprint arXiv:2311.02462 | Meredith Ringel Morris Jascha Sohl-dickstein Noah Fiedel Tris Warkentin Allan Dafoe | 2023/11/4 |
Model evaluation for extreme risks | arXiv preprint arXiv:2305.15324 | Toby Shevlane Sebastian Farquhar Ben Garfinkel Mary Phuong Jess Whittlestone | 2023/5/24 |
Randomisation inference beyond the sharp null: bounded null hypotheses and quantiles of individual treatment effects | Journal of the Royal Statistical Society Series B: Statistical Methodology | Devin Caughey Allan Dafoe Xinran Li Luke Miratrix | 2023/11 |
Placebo tests for causal inference | American Journal of Political Science | Andrew C Eggers Guadalupe Tuñón Allan Dafoe | 2023/8/17 |
EJIS | Edward Newman Jason Ralph Jacqui True Karin Aggestam Navnita Chadha Behera | 2021/2 | |
Ethics and governance of artificial intelligence: a survey of machine learning researchers | Baobao Zhang Markus Anderljung Lauren Kahn Noemi Dreksler Michael C Horowitz | 2022 | |
Safety Not Guaranteed: International Races for Risky Technologies | Eoghan Stafford Robert F Trager Allan Dafoe | 2022/11 | |
Differential technology development: A responsible innovation principle for navigating technology risks | Available at SSRN 4213670 | Jonas Sandbrink Hamish Hobbs Jacob Swett Allan Dafoe Anders Sandberg | 2022/9/8 |
Forecasting AI progress: Evidence from a survey of machine learning researchers | arXiv preprint arXiv:2206.04132 | Baobao Zhang Noemi Dreksler Markus Anderljung Lauren Kahn Charlie Giattino | 2022/6/8 |
Provocation, public opinion, and international disputes: Evidence from China | International Studies Quarterly | Allan Dafoe Samuel Liu Brian O'Keefe Jessica Chen Weiss | 2022/6 |
AI governance: overview and theoretical lenses | Allan Dafoe | 2022/2/14 |