He He
New York University
H-index: 30
North America-United States
Top articles of He He
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Testing the general deductive reasoning capacity of large language models using ood examples | Advances in Neural Information Processing Systems | Abulhair Saparov Richard Yuanzhe Pang Vishakh Padmakumar Nitish Joshi Mehran Kazemi | 2024/2/13 |
Iterative Reasoning Preference Optimization | arXiv preprint arXiv:2404.19733 | Richard Yuanzhe Pang Weizhe Yuan Kyunghyun Cho He He Sainbayar Sukhbaatar | 2024/4/30 |
Towards Consistent Natural-Language Explanations via Explanation-Consistency Finetuning | arXiv preprint arXiv:2401.13986 | Yanda Chen Chandan Singh Xiaodong Liu Simiao Zuo Bin Yu | 2024/1/25 |
The PRISM Alignment Project: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models | arXiv preprint arXiv:2404.16019 | Hannah Rose Kirk Alexander Whitefield Paul Röttger Andrew Bean Katerina Margatina | 2024/4/24 |
Solving olympiad geometry without human demonstrations | Nature | Trieu H Trinh Yuhuai Wu Quoc V Le He He Thang Luong | 2024/1 |
Foundational challenges in assuring alignment and safety of large language models | arXiv preprint arXiv:2404.09932 | Usman Anwar Abulhair Saparov Javier Rando Daniel Paleka Miles Turpin | 2024/4/15 |
Your Co-Workers Matter: Evaluating Collaborative Capabilities of Language Models in Blocks World | arXiv preprint arXiv:2404.00246 | Guande Wu Chen Zhao Claudio Silva He He | 2024/3/30 |
Parallel Structures in Pre-training Data Yield In-Context Learning | arXiv preprint arXiv:2402.12530 | Yanda Chen Chen Zhao Zhou Yu Kathleen McKeown He He | 2024/2/19 |
Show Your Work with Confidence: Confidence Bands for Tuning Curves | arXiv preprint arXiv:2311.09480 | Nicholas Lourie Kyunghyun Cho He He | 2023/11/16 |
Leveraging Implicit Feedback from Deployment Data in Dialogue | arXiv preprint arXiv:2307.14117 | Richard Yuanzhe Pang Stephen Roller Kyunghyun Cho He He Jason Weston | 2023/7/26 |
Personas as a way to model truthfulness in language models | arXiv preprint arXiv:2310.18168 | Nitish Joshi Javier Rando Abulhair Saparov Najoung Kim He He | 2023/10/27 |
Efficient shapley values estimation by amortization for text classification | ACL 2023 Long Paper | Chenghao Yang Fan Yin He He Kai-Wei Chang Xiaofei Ma | 2023 |
Does Writing with Language Models Reduce Content Diversity? | arXiv preprint arXiv:2309.05196 | Vishakh Padmakumar He He | 2023/9/11 |
Measuring inductive biases of in-context learning with underspecified demonstrations | arXiv preprint arXiv:2305.13299 | Chenglei Si Dan Friedman Nitish Joshi Shi Feng Danqi Chen | 2023/5/22 |
Improving Multi-Hop Reasoning in LLMs by Learning from Rich Human Feedback | Nitish Joshi Koushik Kalyanaraman Zhiting Hu Kumar Chellapilla He He | 2023/12/11 | |
Do models explain themselves? counterfactual simulatability of natural language explanations | arXiv preprint arXiv:2307.08678 | Yanda Chen Ruiqi Zhong Narutatsu Ri Chen Zhao He He | 2023/7/17 |
How do decoding algorithms distribute information in dialogue responses? | arXiv preprint arXiv:2303.17006 | Saranya Venkatraman He He David Reitter | 2023/5 |
Pragmatic Radiology Report Generation | Dang Nguyen Chacha Chen He He Chenhao Tan | 2023/12/4 | |
Extrapolative controlled sequence generation via iterative refinement | Vishakh Padmakumar Richard Yuanzhe Pang He He Ankur P Parikh | 2023/7/3 | |
Creative Natural Language Generation | Tuhin Chakrabarty Vishakh Padmakumar He He Nanyun Peng | 2023/12 |