Jason Phang
New York University
H-index: 23
North America-United States
Top articles of Jason Phang
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Investigating the Effectiveness of HyperTuning via Gisting | arXiv preprint arXiv:2402.16817 | Jason Phang | 2024/2/26 |
Tool learning with foundation models | arXiv preprint arXiv:2304.08354 | Yujia Qin Shengding Hu Yankai Lin Weize Chen Ning Ding | 2023/4/17 |
Hypertuning: Toward adapting large language models without back-propagation | Jason Phang Yi Mao Pengcheng He Weizhu Chen | 2023/7/3 | |
Bloom: A 176b-parameter open-access multilingual language model | Teven Le Scao Angela Fan Christopher Akiki Ellie Pavlick Suzana Ilić | 2023/11/20 | |
Two failures of self-consistency in the multi-step reasoning of llms | arXiv preprint arXiv:2305.14279 | Angelica Chen Jason Phang Alicia Parrish Vishakh Padmakumar Chen Zhao | 2023/5/23 |
Pretraining language models with human preferences | Tomasz Korbak Kejian Shi Angelica Chen Rasika Vinayak Bhalerao Christopher Buckley | 2023/7/3 | |
Single-turn debate does not help humans answer hard reading-comprehension questions | arXiv preprint arXiv:2204.05212 | Alicia Parrish Harsh Trivedi Ethan Perez Angelica Chen Nikita Nangia | 2022/4/11 |
Squality: Building a long-document summarization dataset the hard way | arXiv preprint arXiv:2205.11465 | Alex Wang Richard Yuanzhe Pang Angelica Chen Jason Phang Samuel R Bowman | 2022/5/23 |
Investigating efficiently extending transformers for long input summarization | arXiv preprint arXiv:2208.04347 | Jason Phang Yao Zhao Peter J Liu | 2022/8/8 |
QuALITY: Question Answering with Long Input Texts, Yes! | NAACL 2022 | Samuel R Bowman Angelica Chen He He Nitish Joshi Johnny Ma | 2022/5 |
What language model to train if you have one million gpu hours? | Teven Le Scao Thomas Wang Daniel Hesslow Lucile Saulnier Stas Bekman | 2022/3/9 | |
EleutherAI: Going Beyond" Open Science" to" Science in the Open" | arXiv preprint arXiv:2210.06413 | Jason Phang Herbie Bradley Leo Gao Louis Castricato Stella Biderman | 2022/10/12 |
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models | arXiv preprint arXiv:2206.04615 | Aarohi Srivastava Abhinav Rastogi Abhishek Rao Abu Awal Md Shoeb Abubakar Abid | 2022/6/9 |
Gpt-neox-20b: An open-source autoregressive language model | arXiv preprint arXiv:2204.06745 | Sid Black Stella Biderman Eric Hallahan Quentin Anthony Leo Gao | 2022/4/14 |
Two-Turn Debate Doesn't Help Humans Answer Hard Reading Comprehension Questions | arXiv preprint arXiv:2210.10860 | Alicia Parrish Harsh Trivedi Nikita Nangia Vishakh Padmakumar Jason Phang | 2022/10/19 |
What do nlp researchers believe? results of the nlp community metasurvey | arXiv preprint arXiv:2208.12852 | Julian Michael Ari Holtzman Alicia Parrish Aaron Mueller Alex Wang | 2022/8/26 |
Comparing test sets with item response theory | arXiv preprint arXiv:2106.00840 | Clara Vania Phu Mon Htut William Huang Dhara Mungra Richard Yuanzhe Pang | 2021/6/1 |
BBQ: A hand-built bias benchmark for question answering | arXiv preprint arXiv:2110.08193 | Alicia Parrish Angelica Chen Nikita Nangia Vishakh Padmakumar Jason Phang | 2021/10/15 |
Reducing false-positive biopsies using deep neural networks that utilize both local and global image context of screening mammograms | Journal of Digital Imaging | Nan Wu Zhe Huang Yiqiu Shen Jungkyu Park Jason Phang | 2021/12 |
An interpretable classifier for high-resolution breast cancer screening images utilizing weakly supervised localization | Medical image analysis | Yiqiu Shen Nan Wu Jason Phang Jungkyu Park Kangning Liu | 2021/2/1 |