Liyuan Liu
University of Illinois at Urbana-Champaign
H-index: 21
North America-United States
Top articles of Liyuan Liu
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Learning a Decision Tree Algorithm with Transformers | ???????????? arXiv preprint | Yufan Zhuang Liyuan Liu Chandan Singh Jingbo Shang Jianfeng Gao | 2024/2/6 |
Bridging discrete and backpropagation: Straight-through and beyond | Advances in Neural Information Processing Systems | Liyuan Liu Chengyu Dong Xiaodong Liu Bin Yu Jianfeng Gao | 2024/2/13 |
Understand and modularize generator optimization in ELECTRA-style pretraining | Chengyu Dong Liyuan Liu Hao Cheng Jingbo Shang Jianfeng Gao | 2023/7/3 | |
Tell your model where to attend: Post-hoc attention steering for llms | ???? ICLR | Qingru Zhang Chandan Singh Liyuan Liu Xiaodong Liu Bin Yu | 2024/1/1 |
Toward Student-oriented Teacher Network Training for Knowledge Distillation | Chengyu Dong Liyuan Liu Jingbo Shang | 2023/10/13 | |
Fast-ELECTRA for Efficient Pre-training | arXiv preprint arXiv:2310.07347 | Chengyu Dong Liyuan Liu Hao Cheng Jingbo Shang Jianfeng Gao | 2023/10/11 |
Model tells you what to discard: Adaptive kv cache compression for llms | ICLR 2024 Oral | Suyu Ge Yunan Zhang Liyuan Liu Minjia Zhang Jiawei Han | 2023/10/3 |
Sparse backpropagation for moe training | arXiv preprint arXiv:2310.00811 | Liyuan Liu Jianfeng Gao Weizhu Chen | 2023/10/1 |
Label noise in adversarial training: A novel perspective to study robust overfitting | Advances in Neural Information Processing Systems | Chengyu Dong Liyuan Liu Jingbo Shang | 2022/12/6 |
SoTeacher: Toward Student-oriented Teacher Network Training for Knowledge Distillation | Chengyu Dong Liyuan Liu Jingbo Shang | 2022/9/29 | |
SoTeacher: A Student-oriented Teacher Network Training Framework for Knowledge Distillation | arXiv preprint arXiv:2206.06661 | Chengyu Dong Liyuan Liu Jingbo Shang | 2022/6/14 |
P4E: Few-Shot Event Detection as Prompt-Guided Identification and Localization | arXiv preprint arXiv:2202.07615 | Sha Li Liyuan Liu Yiqing Xie Heng Ji Jiawei Han | 2022/2/15 |
Ucphrase: Unsupervised context-aware quality phrase tagging | Xiaotao Gu Zihan Wang Zhenyu Bi Yu Meng Liyuan Liu | 2021/8/14 | |
Multi-head or single-head? an empirical comparison for transformer training | arXiv preprint arXiv:2106.09650 | Liyuan Liu Jialu Liu Jiawei Han | 2021/6/17 |
Empower distantly supervised relation extraction with collaborative adversarial training | Proceedings of the AAAI Conference on Artificial Intelligence | Tao Chen Haochen Shi Liyuan Liu Siliang Tang Jian Shao | 2021/5/18 |
Data quality matters for adversarial training: An empirical study | arXiv preprint arXiv:2102.07437 | Chengyu Dong Liyuan Liu Jingbo Shang | 2021/2/15 |
Partially-typed NER datasets integration: Connecting practice to theory | arXiv preprint arXiv:2005.00502 | Shi Zhi Liyuan Liu Yu Zhang Shiyin Wang Qi Li | 2020/5/1 |
Towards adaptive residual network training: A neural-ode perspective | Chengyu Dong Liyuan Liu Zichao Li Jingbo Shang | 2020/11/21 | |
Nettaxo: Automated topic taxonomy construction from text-rich network | Jingbo Shang Xinyang Zhang Liyuan Liu Sha Li Jiawei Han | 2020/4/20 | |
On the transformer growth for progressive bert training | arXiv preprint arXiv:2010.12562 | Xiaotao Gu Liyuan Liu Hongkun Yu Jing Li Chen Chen | 2020/10/23 |