Noah D. Goodman
Stanford University
H-index: 76
North America-United States
Top articles of Noah D. Goodman
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Emotional Theory of Mind in Humans and Machines (Experiment 2) | Kanishk Gandhi Jan-Philipp Fränken Desmond C Ong Noah D Goodman | 2024/2/13 | |
pyvene: A library for understanding and improving PyTorch models via interventions | arXiv preprint arXiv:2403.07809 | Zhengxuan Wu Atticus Geiger Aryaman Arora Jing Huang Zheng Wang | 2024/3/12 |
Feature dropout: Revisiting the role of augmentations in contrastive learning | Advances in Neural Information Processing Systems | Alex Tamkin Margalit Glasgow Xiluo He Noah Goodman | 2024/2/13 |
Bayesian reinforcement learning with limited cognitive load | Open Mind | Dilip Arumugam Mark K Ho Noah D Goodman Benjamin Van Roy | 2024/4/16 |
Interaction structure constrains the emergence of conventions in group communication | PsyArXiv | Veronica Boyce Robert Hawkins Noah D Goodman Michael C Frank | 2024 |
Bayesian preference elicitation with language models | arXiv preprint arXiv:2403.05534 | Kunal Handa Yarin Gal Ellie Pavlick Noah Goodman Jacob Andreas | 2024/3/8 |
Stream of Search (SoS): Learning to Search in Language | arXiv preprint arXiv:2404.03683 | Kanishk Gandhi Denise Lee Gabriel Grand Muxin Liu Winson Cheng | 2024/4/1 |
Understanding social reasoning in language models with language models | Kanishk Gandhi Jan-Philipp Fränken Tobias Gerstenberg Noah D Goodman | 2023/6/21 | |
A Reply to Makelov et al.(2023)'s" Interpretability Illusion" Arguments | arXiv preprint arXiv:2401.12631 | Zhengxuan Wu Atticus Geiger Jing Huang Aryaman Arora Thomas Icard | 2024/1/23 |
Backtracing: Retrieving the Cause of the Query | arXiv preprint arXiv:2403.03956 | Rose E Wang Pawan Wirawarn Omar Khattab Noah Goodman Dorottya Demszky | 2024/3/6 |
Interpretability at scale: Identifying causal mechanisms in alpaca | Advances in Neural Information Processing Systems | Zhengxuan Wu Atticus Geiger Thomas Icard Christopher Potts Noah Goodman | 2024/2/13 |
Star-gate: Teaching language models to ask clarifying questions | arXiv preprint arXiv:2403.19154 | Chinmaya Andukuri Jan-Philipp Fränken Tobias Gerstenberg Noah D Goodman | 2024/3/28 |
Evaluating and Optimizing Educational Content with Large Language Model Judgments | arXiv preprint arXiv:2403.02795 | Joy He-Yueya Noah D Goodman Emma Brunskill | 2024/3/5 |
Finding alignments between interpretable causal variables and distributed neural representations | Atticus Geiger Zhengxuan Wu Christopher Potts Thomas Icard Noah Goodman | 2024/3/15 | |
Learning to compress prompts with gist tokens | Advances in Neural Information Processing Systems | Jesse Mu Xiang Li Noah Goodman | 2023/12/10 |
Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels | arXiv preprint arXiv:2404.14313 | Jan-Philipp Fränken Eric Zelikman Rafael Rafailov Kanishk Gandhi Tobias Gerstenberg | 2024/4/22 |
Automated Statistical Model Discovery with Language Models | arXiv preprint arXiv:2402.17879 | Michael Y Li Emily B Fox Noah D Goodman | 2024/2/27 |
Why think step by step? Reasoning emerges from the locality of experience | arXiv preprint arXiv:2304.03843 | Ben Prystawski Michael Y Li Noah D Goodman | 2023/4/7 |
Quiet-star: Language models can teach themselves to think before speaking | arXiv preprint arXiv:2403.09629 | Eric Zelikman Georges Harik Yijia Shao Varuna Jayasiri Nick Haber | 2024/3/14 |
Procedural Dilemma Generation for Evaluating Moral Reasoning in Humans and Language Models | arXiv preprint arXiv:2404.10975 | Jan-Philipp Fränken Kanishk Gandhi Tori Qiu Ayesha Khawaja Noah D Goodman | 2024/4/17 |