Canwen Xu

Canwen Xu

University of California, San Diego

H-index: 16

North America-United States

About Canwen Xu

Canwen Xu, With an exceptional h-index of 16 and a recent h-index of 16 (since 2020), a distinguished researcher at University of California, San Diego, specializes in the field of natural language processing, machine learning.

His recent articles reflect a diverse array of research interests and contributions to the field:

StarCoder 2 and The Stack v2: The Next Generation

Contrastive post-training large language models on data curriculum

A survey on dynamic neural networks for natural language processing

LongCoder: A Long-Range Pre-trained Language Model for Code Completion

A survey on model compression and acceleration for pretrained language models

Repobench: Benchmarking repository-level code auto-completion systems

Small models are valuable plug-ins for large language models

Spoiler Detection as Semantic Text Matching

Canwen Xu Information

University

Position

___

Citations(all)

15089

Citations(since 2020)

15077

Cited By

1946

hIndex(all)

16

hIndex(since 2020)

16

i10Index(all)

21

i10Index(since 2020)

21

Email

University Profile Page

University of California, San Diego

Google Scholar

View Google Scholar Profile

Canwen Xu Skills & Research Interests

natural language processing

machine learning

Top articles of Canwen Xu

Title

Journal

Author(s)

Publication Date

StarCoder 2 and The Stack v2: The Next Generation

arXiv preprint arXiv:2402.19173

Anton Lozhkov

Raymond Li

Loubna Ben Allal

Federico Cassano

Joel Lamy-Poirier

...

2024/2/29

Contrastive post-training large language models on data curriculum

arXiv preprint arXiv:2310.02263

Canwen Xu

Corby Rosset

Luciano Del Corro

Shweti Mahajan

Julian McAuley

...

2023/10/3

A survey on dynamic neural networks for natural language processing

Canwen Xu

Julian McAuley

2023/2/15

LongCoder: A Long-Range Pre-trained Language Model for Code Completion

Daya Guo

Canwen Xu

Nan Duan

Jian Yin

Julian McAuley

2023/7/3

A survey on model compression and acceleration for pretrained language models

Canwen Xu

Julian McAuley

2023/2/15

Repobench: Benchmarking repository-level code auto-completion systems

arXiv preprint arXiv:2306.03091

Tianyang Liu

Canwen Xu

Julian McAuley

2023/6/5

Small models are valuable plug-ins for large language models

arXiv preprint arXiv:2305.08848

Canwen Xu

Yichong Xu

Shuohang Wang

Yang Liu

Chenguang Zhu

...

2023/5/15

Spoiler Detection as Semantic Text Matching

Ryan Tran

Canwen Xu

Julian McAuley

2023/12/1

Mirror: A Natural Language Interface for Data Querying, Summarization, and Visualization

Canwen Xu

Julian McAuley

Penghan Wang

2023/4/30

Bloom: A 176b-parameter open-access multilingual language model

Teven Le Scao

Angela Fan

Christopher Akiki

Ellie Pavlick

Suzana Ilić

...

2023/11/20

Baize: An open-source chat model with parameter-efficient tuning on self-chat data

arXiv preprint arXiv:2304.01196

Canwen Xu

Daya Guo

Nan Duan

Julian McAuley

2023/4/3

Automatic Multi-Label Prompting: Simple and Interpretable Few-Shot Classification

NAACL 2022

Han Wang

Canwen Xu

Julian McAuley

2022/4/13

LaPraDoR: Unsupervised Pretrained Dense Retriever for Zero-Shot Text Retrieval

Canwen Xu

Daya Guo

Nan Duan

Julian McAuley

2022/3/11

Leashing the Inner Demons: Self-Detoxification for Language Models

Canwen Xu

Zexue He

Zhankui He

Julian McAuley

2022/3/6

Efficiently Tuned Parameters are Task Embeddings

Wangchunshu Zhou

Canwen Xu

Julian McAuley

2022/10/21

PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts

arXiv preprint arXiv:2202.01279

Stephen H Bach

Victor Sanh

Zheng-Xin Yong

Albert Webson

Colin Raffel

...

2022/2/2

Informask: Unsupervised informative masking for language model pretraining

Nafis Sadeq

Canwen Xu

Julian McAuley

2022/10/21

BERT learns to teach: Knowledge distillation with meta learning

Wangchunshu Zhou

Canwen Xu

Julian McAuley

2022/5

???? Datasets: A Community Library for Natural Language Processing

EMNLP 2021 (Demo)

Quentin Lhoest

Albert Villanova del Moral

Yacine Jernite

Abhishek Thakur

Patrick von Platen

...

2021/9/7

Beyond Preserved Accuracy: Evaluating Loyalty and Robustness of BERT Compression

EMNLP 2021

Canwen Xu

Wangchunshu Zhou

Tao Ge

Ke Xu

Julian McAuley

...

2021/9/7

See List of Professors in Canwen Xu University(University of California, San Diego)