Yikang Shen at Université de Montréal

University	Université de Montréal
Position	Mila
Citations(all)	1849
Citations(since 2020)	1763
Cited By	558
hIndex(all)	16
hIndex(since 2020)	16
i10Index(all)	19
i10Index(since 2020)	19
Email	Access Email
University Profile Page	Université de Montréal
Google Scholar	View Google Scholar Profile

JetMoE: Reaching Llama2 Performance with 0.1 M Dollars

arXiv preprint arXiv:2404.07413

2024/4/11

Yikang Shen

H-Index: 8

Zhen Guo

H-Index: 2

Tianle Cai

H-Index: 4

Zengyi Qin

H-Index: 4

Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models

arXiv preprint arXiv:2404.05567

2024/4/8

Bowen Pan

H-Index: 3

Yikang Shen

H-Index: 8

Haokun Liu

H-Index: 6

Mayank Mishra

H-Index: 8

Aude Oliva

H-Index: 60

Colin Raffel

H-Index: 29

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

arXiv preprint arXiv:2403.09472

2024/3/14

Zhiqing Sun

H-Index: 8

Yikang Shen

H-Index: 8

Weiyang Liu

H-Index: 16

Yiming Yang

H-Index: 4

Chuang Gan

H-Index: 37

Scattered Mixture-of-Experts Implementation

arXiv preprint arXiv:2403.08245

2024/3/13

Shawn Tan

H-Index: 5

Yikang Shen

H-Index: 8

Aaron Courville

H-Index: 69

Visual Chain-of-Thought Prompting for Knowledge-based Visual Reasoning

2024/2/18

Yikang Shen

H-Index: 8

Zhiqing Sun

H-Index: 8

Chuang Gan

H-Index: 37

API Pack: A Massive Multilingual Dataset for API Call Generation

arXiv preprint arXiv:2402.09615

2024/2/14

Zhen Guo

H-Index: 2

Wei Sun

H-Index: 7

Yikang Shen

H-Index: 8

Adaptive Online Replanning with Diffusion Models

NeurIPS

2023/10/14

Siyuan Zhou

H-Index: 6

Yilun Du

H-Index: 7

Shun Zhang

H-Index: 13

Mengdi Xu

H-Index: 3

Yikang Shen

H-Index: 8

Wei Xiao

H-Index: 23

Chuang Gan

H-Index: 37

Principle-driven self-alignment of language models from scratch with minimal human supervision

Advances in Neural Information Processing Systems

2024/2/13

Zhiqing Sun

H-Index: 8

Yikang Shen

H-Index: 8

Hongxin Zhang

H-Index: 11

Yiming Yang

H-Index: 4

Chuang Gan

H-Index: 37

Diversity Measurement and Subset Selection for Instruction Tuning Datasets

arXiv preprint arXiv:2402.02318

2024/2/4

Peiqi Wang

H-Index: 5

Yikang Shen

H-Index: 8

Zhen Guo

H-Index: 2

Yoon Kim

H-Index: 11

Polina Golland

H-Index: 32

Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble

arXiv preprint arXiv:2401.16635

2024/1/30

Shun Zhang

H-Index: 13

Yikang Shen

H-Index: 8

Zhiqing Sun

H-Index: 8

Chuang Gan

H-Index: 37

Structured Code Representations Enable Data-Efficient Adaptation of Code Language Models

arXiv preprint arXiv:2401.10716

2024/1/19

Mayank Agarwal

H-Index: 2

Yikang Shen

H-Index: 8

Yoon Kim

H-Index: 11

Jie Chen

H-Index: 32

Gated linear attention transformers with hardware-efficient training

arXiv preprint arXiv:2312.06635

2023/12/11

Yikang Shen

H-Index: 8

Yoon Kim

H-Index: 11

CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding

arXiv preprint arXiv:2311.03354

2023/11/6

Junyan Li

H-Index: 8

Peihao Chen

H-Index: 7

Yikang Shen

H-Index: 8

Chuang Gan

H-Index: 37

Autonomous Tree-search Ability of Large Language Models

arXiv preprint arXiv:2310.10686

2023/10/14

Yikang Shen

H-Index: 8

Chuang Gan

H-Index: 37

Compositional VLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding

2023/10/13

Junyan Li

H-Index: 8

Peihao Chen

H-Index: 7

Yikang Shen

H-Index: 8

Chuang Gan

H-Index: 37

The consensus game: Language model generation via equilibrium search

ICLR

2024

Athul Paul Jacob

H-Index: 4

Yikang Shen

H-Index: 8

Gabriele Farina

H-Index: 10

Jacob Andreas

H-Index: 24

Sparse universal transformer

arXiv preprint arXiv:2310.07096

2023/10/11

Shawn Tan

H-Index: 5

Yikang Shen

H-Index: 8

Aaron Courville

H-Index: 69

Chuang Gan

H-Index: 37

Salmon: Self-alignment with principle-following reward models

International Conference on Learning Representations (ICLR)

2024/1/16

Zhiqing Sun

H-Index: 8

Yikang Shen

H-Index: 8

Hongxin Zhang

H-Index: 11

Yiming Yang

H-Index: 4

Chuang Gan

H-Index: 37

Graphtext: Graph reasoning in text space

arXiv preprint arXiv:2310.01089

2023/10/2

Jianan Zhao

H-Index: 1

Yikang Shen

H-Index: 8

Meng Qu

H-Index: 4

Kai Liu

H-Index: 4

Michael Bronstein

H-Index: 7

Jian Tang

H-Index: 2

Aligning large multimodal models with factually augmented rlhf

arXiv preprint arXiv:2309.14525

2023/9/25

Haotian Liu

H-Index: 2

Yikang Shen

H-Index: 8

Chuang Gan

H-Index: 37

Yiming Yang

H-Index: 4

Kurt Keutzer

H-Index: 41

Trevor Darrell

H-Index: 101

Yikang Shen

Université de Montréal

About Yikang Shen

Yikang Shen Information

Yikang Shen Skills & Research Interests

Top articles of Yikang Shen

JetMoE: Reaching Llama2 Performance with 0.1 M Dollars

Yikang Shen

Zhen Guo

Tianle Cai

Zengyi Qin

Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models

Bowen Pan

Yikang Shen

Haokun Liu

Mayank Mishra

Aude Oliva

Colin Raffel

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Zhiqing Sun

Yikang Shen

Weiyang Liu

Yiming Yang

Chuang Gan

Scattered Mixture-of-Experts Implementation

Shawn Tan

Yikang Shen

Aaron Courville

Visual Chain-of-Thought Prompting for Knowledge-based Visual Reasoning

Yikang Shen

Zhiqing Sun

Chuang Gan

API Pack: A Massive Multilingual Dataset for API Call Generation

Zhen Guo

Wei Sun

Yikang Shen

Adaptive Online Replanning with Diffusion Models

Siyuan Zhou

Yilun Du

Shun Zhang

Mengdi Xu

Yikang Shen

Wei Xiao

Chuang Gan

Principle-driven self-alignment of language models from scratch with minimal human supervision

Zhiqing Sun

Yikang Shen

Hongxin Zhang

Yiming Yang

Chuang Gan

Diversity Measurement and Subset Selection for Instruction Tuning Datasets

Peiqi Wang

Yikang Shen

Zhen Guo

Yoon Kim

Polina Golland

Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble

Shun Zhang

Yikang Shen

Zhiqing Sun

Chuang Gan

Structured Code Representations Enable Data-Efficient Adaptation of Code Language Models

Mayank Agarwal

Yikang Shen

Yoon Kim

Jie Chen

Gated linear attention transformers with hardware-efficient training

Yikang Shen

Yoon Kim

CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding

Junyan Li

Peihao Chen

Yikang Shen

Chuang Gan

Autonomous Tree-search Ability of Large Language Models

Yikang Shen

Chuang Gan

Compositional VLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding

Junyan Li

Peihao Chen