Canwen Xu
University of California, San Diego
H-index: 16
North America-United States
Top articles of Canwen Xu
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
StarCoder 2 and The Stack v2: The Next Generation | arXiv preprint arXiv:2402.19173 | Anton Lozhkov Raymond Li Loubna Ben Allal Federico Cassano Joel Lamy-Poirier | 2024/2/29 |
Contrastive post-training large language models on data curriculum | arXiv preprint arXiv:2310.02263 | Canwen Xu Corby Rosset Luciano Del Corro Shweti Mahajan Julian McAuley | 2023/10/3 |
A survey on dynamic neural networks for natural language processing | Canwen Xu Julian McAuley | 2023/2/15 | |
LongCoder: A Long-Range Pre-trained Language Model for Code Completion | Daya Guo Canwen Xu Nan Duan Jian Yin Julian McAuley | 2023/7/3 | |
A survey on model compression and acceleration for pretrained language models | Canwen Xu Julian McAuley | 2023/2/15 | |
Repobench: Benchmarking repository-level code auto-completion systems | arXiv preprint arXiv:2306.03091 | Tianyang Liu Canwen Xu Julian McAuley | 2023/6/5 |
Small models are valuable plug-ins for large language models | arXiv preprint arXiv:2305.08848 | Canwen Xu Yichong Xu Shuohang Wang Yang Liu Chenguang Zhu | 2023/5/15 |
Spoiler Detection as Semantic Text Matching | Ryan Tran Canwen Xu Julian McAuley | 2023/12/1 | |
Mirror: A Natural Language Interface for Data Querying, Summarization, and Visualization | Canwen Xu Julian McAuley Penghan Wang | 2023/4/30 | |
Bloom: A 176b-parameter open-access multilingual language model | Teven Le Scao Angela Fan Christopher Akiki Ellie Pavlick Suzana Ilić | 2023/11/20 | |
Baize: An open-source chat model with parameter-efficient tuning on self-chat data | arXiv preprint arXiv:2304.01196 | Canwen Xu Daya Guo Nan Duan Julian McAuley | 2023/4/3 |
Automatic Multi-Label Prompting: Simple and Interpretable Few-Shot Classification | NAACL 2022 | Han Wang Canwen Xu Julian McAuley | 2022/4/13 |
LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrieval | Canwen Xu Daya Guo Nan Duan Julian McAuley | 2022/3/11 | |
Leashing the Inner Demons: Self-Detoxification for Language Models | Canwen Xu Zexue He Zhankui He Julian McAuley | 2022/3/6 | |
Efficiently Tuned Parameters are Task Embeddings | Wangchunshu Zhou Canwen Xu Julian McAuley | 2022/10/21 | |
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts | arXiv preprint arXiv:2202.01279 | Stephen H Bach Victor Sanh Zheng-Xin Yong Albert Webson Colin Raffel | 2022/2/2 |
Informask: Unsupervised informative masking for language model pretraining | Nafis Sadeq Canwen Xu Julian McAuley | 2022/10/21 | |
BERT learns to teach: Knowledge distillation with meta learning | Wangchunshu Zhou Canwen Xu Julian McAuley | 2022/5 | |
???? Datasets: A Community Library for Natural Language Processing | EMNLP 2021 (Demo) | Quentin Lhoest Albert Villanova del Moral Yacine Jernite Abhishek Thakur Patrick von Platen | 2021/9/7 |
Beyond Preserved Accuracy: Evaluating Loyalty and Robustness of BERT Compression | EMNLP 2021 | Canwen Xu Wangchunshu Zhou Tao Ge Ke Xu Julian McAuley | 2021/9/7 |