Haokun Liu at New York University

University	New York University
Position	___
Citations(all)	1266
Citations(since 2020)	1263
Cited By	180
hIndex(all)	11
hIndex(since 2020)	11
i10Index(all)	11
i10Index(since 2020)	11
Email	Access Email
University Profile Page	New York University
Google Scholar	View Google Scholar Profile

Learning to Route Among Specialized Experts for Zero-Shot Generalization

arXiv preprint arXiv:2402.05859

2024/2/8

Haokun Liu

H-Index: 6

Colin Raffel

H-Index: 29

Git-theta: A git extension for collaborative development of machine learning models

2023/7/3

Haokun Liu

H-Index: 6

Colin Raffel

H-Index: 29

Soft merging of experts with adaptive routing

arXiv preprint arXiv:2306.03745

2023/6/6

Haokun Liu

H-Index: 6

Colin Raffel

H-Index: 29

Models with conditional computation learn suboptimal solutions

2022/12/6

Haokun Liu

H-Index: 6

Colin Raffel

H-Index: 29

Few-shot parameter-efficient fine-tuning is better and cheaper than in-context learning

Advances in Neural Information Processing Systems

2022/12/6

Haokun Liu

H-Index: 6

Derek Tam

H-Index: 3

Mohit Bansal

H-Index: 39

Fine-tuned transformers show clusters of similar representations across layers

arXiv preprint arXiv:2109.08406

2021/9/17

Jason Phang

H-Index: 10

Haokun Liu

H-Index: 6

Comparing test sets with item response theory

arXiv preprint arXiv:2106.00840

2021/6/1

Phu Mon Htut

H-Index: 9

William Huang

H-Index: 33

Richard Yuanzhe Pang

H-Index: 5

Jason Phang

H-Index: 10

Haokun Liu

H-Index: 6

Kyunghyun Cho

H-Index: 66

Learning which features matter: RoBERTa acquires a preference for linguistic generalizations (eventually)

arXiv preprint arXiv:2010.05358

2020/10/11

Alex Warstadt

H-Index: 7

Yian Zhang

H-Index: 6

Haokun Liu

H-Index: 6

Counterfactually-augmented SNLI training data does not yield better generalization than unaugmented data

arXiv preprint arXiv:2010.04762

2020/10/9

William Huang

H-Index: 33

Haokun Liu

H-Index: 6

Precise task formalization matters in Winograd schema evaluations

arXiv preprint arXiv:2010.04043

2020/10/8

Haokun Liu

H-Index: 6

William Huang

H-Index: 33

BLiMP: The benchmark of linguistic minimal pairs for English

Transactions of the Association for Computational Linguistics

2020/7/1

Alex Warstadt

H-Index: 7

Alicia Parrish

H-Index: 2

Haokun Liu

H-Index: 6

Anhad Mohananey

H-Index: 4

Wei Peng

H-Index: 6

English intermediate-task training improves zero-shot cross-lingual transfer too

arXiv preprint arXiv:2005.13013

2020/5/26

Jason Phang

H-Index: 10

Iacer Calixto

H-Index: 10

Phu Mon Htut

H-Index: 9

Yada Pruksachatkun

H-Index: 7

Haokun Liu

H-Index: 6

Katharina Kann

H-Index: 16

Intermediate-task transfer learning with pretrained models for natural language understanding: When and why does it work?

arXiv preprint arXiv:2005.00628

2020/5/1

Yada Pruksachatkun

H-Index: 7

Jason Phang

H-Index: 10

Haokun Liu

H-Index: 6

Phu Mon Htut

H-Index: 9

Xiaoyi Zhang

H-Index: 5

Richard Yuanzhe Pang

H-Index: 5

Katharina Kann

H-Index: 16

jiant: A software toolkit for research on general-purpose text understanding models

arXiv preprint arXiv:2003.02249

2020/3/4

Yada Pruksachatkun

H-Index: 7

Haokun Liu

H-Index: 6

Jason Phang

H-Index: 10

Phu Mon Htut

H-Index: 9

Alex Wang

H-Index: 12

Haokun Liu

New York University

About Haokun Liu

Haokun Liu Information

Haokun Liu Skills & Research Interests

Top articles of Haokun Liu

Learning to Route Among Specialized Experts for Zero-Shot Generalization

Haokun Liu

Colin Raffel

Git-theta: A git extension for collaborative development of machine learning models

Haokun Liu

Colin Raffel

Soft merging of experts with adaptive routing

Haokun Liu

Colin Raffel

Models with conditional computation learn suboptimal solutions

Haokun Liu

Colin Raffel

Few-shot parameter-efficient fine-tuning is better and cheaper than in-context learning

Haokun Liu

Derek Tam

Mohit Bansal

Fine-tuned transformers show clusters of similar representations across layers

Jason Phang

Haokun Liu

Comparing test sets with item response theory

Phu Mon Htut

William Huang

Richard Yuanzhe Pang

Jason Phang

Haokun Liu

Kyunghyun Cho

Learning which features matter: RoBERTa acquires a preference for linguistic generalizations (eventually)

Alex Warstadt

Yian Zhang

Haokun Liu

Counterfactually-augmented SNLI training data does not yield better generalization than unaugmented data

William Huang

Haokun Liu

Precise task formalization matters in Winograd schema evaluations

Haokun Liu

William Huang

BLiMP: The benchmark of linguistic minimal pairs for English

Alex Warstadt

Alicia Parrish

Haokun Liu

Anhad Mohananey

Wei Peng

English intermediate-task training improves zero-shot cross-lingual transfer too

Jason Phang

Iacer Calixto

Phu Mon Htut

Yada Pruksachatkun

Haokun Liu

Katharina Kann

Intermediate-task transfer learning with pretrained models for natural language understanding: When and why does it work?

Yada Pruksachatkun

Jason Phang

Haokun Liu

Phu Mon Htut

Xiaoyi Zhang

Richard Yuanzhe Pang

Katharina Kann

jiant: A software toolkit for research on general-purpose text understanding models

Yada Pruksachatkun

Haokun Liu

Jason Phang

Phu Mon Htut

Alex Wang

Co-Authors

Colin Raffel

Samuel R. Bowman

Katharina von der Wense

Jason Phang

Alex Warstadt

Yada Pruksachatkun