Maarten Sap
University of Washington
H-index: 38
North America-United States
Top articles of Maarten Sap
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
NORMAD: A Benchmark for Measuring the Cultural Adaptability of Large Language Models | arXiv preprint arXiv:2404.12464 | Abhinav Rao Akhila Yerukola Vishwa Shah Katharina Reinecke Maarten Sap | 2024/4/18 |
Particip-AI: A Democratic Surveying Framework for Anticipating Future AI Use Cases, Harms and Benefits | arXiv preprint arXiv:2403.14791 | Jimin Mun Liwei Jiang Jenny Liang Inyoung Cheong Nicole DeCario | 2024/3/21 |
SOTOPIA-: Interactive Learning of Socially Intelligent Language Agents | arXiv preprint arXiv:2403.08715 | Ruiyi Wang Haofei Yu Wenxin Zhang Zhengyang Qi Maarten Sap | 2024/3/13 |
Is this the real life? is this just fantasy? the misleading success of simulating social interactions with llms | arXiv preprint arXiv:2403.05020 | Xuhui Zhou Zhe Su Tiwalayo Eisape Hyunwoo Kim Maarten Sap | 2024/3/8 |
Counterspeakers' Perspectives: Unveiling Barriers and AI Needs in the Fight against Online Hate | arXiv preprint arXiv:2403.00179 | Jimin Mun Cathy Buerger Jenny T Liang Joshua Garland Maarten Sap | 2024/2/29 |
Relying on the Unreliable: The Impact of Language Models' Reluctance to Express Uncertainty | arXiv preprint arXiv:2401.06730 | Kaitlyn Zhou Jena D Hwang Xiang Ren Maarten Sap | 2024/1/12 |
Can LLMs keep a secret? Testing privacy implications of language models via contextual integrity theory | Niloofar Mireshghallah Hyunwoo Kim Xuhui Zhou Yulia Tsvetkov Maarten Sap | 2024 | |
Sotopia: Interactive evaluation for social intelligence in language agents | arXiv preprint arXiv:2310.11667 | Xuhui Zhou Hao Zhu Leena Mathur Ruohong Zhang Haofei Yu | 2023/10/18 |
Value kaleidoscope: Engaging ai with pluralistic human values, rights, and duties | AAAI | Taylor Sorensen Liwei Jiang Jena Hwang Sydney Levine Valentina Pyatkin | 2023/9/2 |
Clever hans or neural theory of mind? stress testing social reasoning in large language models | arXiv preprint arXiv:2305.14763 | Natalie Shapira Mosh Levy Seyed Hossein Alavi Xuhui Zhou Yejin Choi | 2023/5/24 |
Beyond Denouncing Hate: Strategies for Countering Implied Biases and Stereotypes in Language | Jimin Mun Emily Allaway Akhila Yerukola Laura Vianna Sarah-Jane Leslie | 2023/12 | |
Where Do People Tell Stories Online? Story Detection Across Online Communities | arXiv preprint arXiv:2311.09675 | Maria Antoniak Joel Mire Maarten Sap Elliott Ash Andrew Piper | 2023/11/16 |
FANToM: A benchmark for stress-testing machine theory of mind in interactions | arXiv preprint arXiv:2310.15421 | Hyunwoo Kim Melanie Sclar Xuhui Zhou Ronan Le Bras Gunhee Kim | 2023/10/24 |
Queer in AI: a case study in community-led participatory AI | Organizers Of Queerinai Anaelia Ovalle Arjun Subramonian Ashwin Singh Claas Voelcker | 2023/6/12 | |
Cobra frames: Contextual reasoning about effects and harms of offensive statements | Xuhui Zhou Hao Zhu Akhila Yerukola Thomas Davidson Jena D Hwang | 2023/6/3 | |
NLPositionality: Characterizing design biases of datasets and models | arXiv preprint arXiv:2306.01943 | Sebastin Santy Jenny T Liang Ronan Le Bras Katharina Reinecke Maarten Sap | 2023/6/2 |
From dogwhistles to bullhorns: Unveiling coded rhetoric with language models | arXiv preprint arXiv:2305.17174 | Julia Mendelsohn Ronan Le Bras Yejin Choi Maarten Sap | 2023/5/26 |
Improving language models with advantage-based offline policy gradients | arXiv preprint arXiv:2305.14718 | Ashutosh Baheti Ximing Lu Faeze Brahman Ronan Le Bras Maarten Sap | 2023/5/24 |
Don't Take This Out of Context! On the Need for Contextual Models and Evaluations for Stylistic Rewriting | arXiv preprint arXiv:2305.14755 | Akhila Yerukola Xuhui Zhou Maarten Sap | 2023/5/24 |
BiasX:" Thinking Slow" in Toxic Content Moderation with Explanations of Implied Social Biases | arXiv preprint arXiv:2305.13589 | Yiming Zhang Sravani Nanduri Liwei Jiang Tongshuang Wu Maarten Sap | 2023/5/23 |