Peter Henderson
Stanford University
H-index: 22
North America-United States
Top articles of Peter Henderson
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
FLawN-T5: An Empirical Examination of Effective Instruction-Tuning Data Mixtures for Legal Reasoning | arXiv preprint arXiv:2404.02127 | Joel Niklaus Lucia Zheng Arya D McCarthy Christopher Hahn Brian M Rosen | 2024/4/2 |
On the Societal Impact of Open Foundation Models | Sayash Kapoor Rishi Bommasani Kevin Klyman Shayne Longpre Ashwin Ramaswami | 2024/2/27 | |
What's in Your "Safe" Data?: Identifying Benign Data that Breaks Safety | arXiv preprint arXiv:2404.01099 | Luxi He Mengzhou Xia Peter Henderson | 2024/4/1 |
Cheaply estimating inference efficiency metrics for autoregressive transformer models | Advances in Neural Information Processing Systems | Deepak Narayanan Keshav Santhanam Peter Henderson Rishi Bommasani Tony Lee | 2023/11/2 |
Legalbench: A collaboratively built benchmark for measuring legal reasoning in large language models | Advances in Neural Information Processing Systems | Neel Guha Julian Nyarko Daniel Ho Christopher Ré Adam Chilton | 2024/2/13 |
Visual adversarial examples jailbreak aligned large language models | AAAI Conference on Artificial Intelligence, 2024 (Oral) | Xiangyu Qi Kaixuan Huang Ashwinee Panda Peter Henderson Mengdi Wang | 2023/6/22 |
Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications | arXiv preprint arXiv:2402.05162 | Boyi Wei* Kaixuan Huang* Yangsibo Huang* Tinghao Xie Xiangyu Qi | 2024/2/7 |
Rethinking Machine Learning Benchmarks in the Context of Professional Codes of Conduct | Peter Henderson Jieru Hu Mona Diab Joelle Pineau | 2024/3/12 | |
Promises and pitfalls of artificial intelligence for legal applications | arXiv preprint arXiv:2402.01656 | Sayash Kapoor Peter Henderson Arvind Narayanan | 2024/1/10 |
Corpus Enigmas and Contradictory Linguistics: Tensions between Empirical Semantic Meaning and Judicial Interpretation | Minnesota Journal of Law, Science & Technology, Forthcoming | Peter Henderson Daniel E Ho Andrea Vallebueno Cassandra Handan-Nader | 2024/4/9 |
A Safe Harbor for AI Evaluation and Red Teaming | arXiv preprint arXiv:2403.04893 | Shayne Longpre Sayash Kapoor Kevin Klyman Ashwin Ramaswami Rishi Bommasani | 2024/3/7 |
Algorithmic Rulemaking vs. Algorithmic Guidance | Harvard Journal of Law & Technology | Peter Henderson Mark Krass | 2023 |
Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To! | arXiv preprint arXiv:2310.03693 | Xiangyu Qi Yi Zeng Tinghao Xie Pin-Yu Chen Ruoxi Jia | 2023/10/5 |
Self-Destructing Models: Increasing the Costs of Harmful Dual Uses of Foundation Models | Peter Henderson Eric Mitchell Christopher Manning Dan Jurafsky Chelsea Finn | 2023/8/8 | |
Aligning Law, Policy, and Machine Learning for Responsible Real-World Deployments | Peter Henderson | 2023 | |
Entropy Regularization for Population Estimation | Proceedings of the AAAI Conference on Artificial Intelligence | Ben Chugg Peter Henderson Jacob Goldin Daniel E Ho | 2023/6/26 |
Freedom of Speech and AI Output | J. Free Speech L. | Eugene Volokh Mark A Lemley Peter Henderson | 2023 |
Where's the Liability in harmful AI Speech? | J. Free Speech L. | Peter Henderson Tatsunori Hashimoto Mark Lemley | 2023 |
Integrating reward maximization and population estimation: Sequential decision-making for Internal Revenue Service audit selection | Proceedings of the AAAI Conference on Artificial Intelligence | Peter Henderson Ben Chugg Brandon Anderson Kristen Altenburger Alex Turk | 2023/6/26 |
Foundation Models and Fair Use | arXiv preprint arXiv:2303.15715 | Peter Henderson Xuechen Li Dan Jurafsky Tatsunori Hashimoto Mark A Lemley | 2023/3/28 |