Yikang Shen

Yikang Shen

Université de Montréal

H-index: 16

North America-Canada

About Yikang Shen

Yikang Shen, With an exceptional h-index of 16 and a recent h-index of 16 (since 2020), a distinguished researcher at Université de Montréal, specializes in the field of Deep Learning, Natural Language Processing.

His recent articles reflect a diverse array of research interests and contributions to the field:

JetMoE: Reaching Llama2 Performance with 0.1 M Dollars

Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

Scattered Mixture-of-Experts Implementation

Visual Chain-of-Thought Prompting for Knowledge-based Visual Reasoning

API Pack: A Massive Multilingual Dataset for API Call Generation

Adaptive Online Replanning with Diffusion Models

Principle-driven self-alignment of language models from scratch with minimal human supervision

Yikang Shen Information

University

Position

Mila

Citations(all)

1849

Citations(since 2020)

1763

Cited By

558

hIndex(all)

16

hIndex(since 2020)

16

i10Index(all)

19

i10Index(since 2020)

19

Email

University Profile Page

Google Scholar

Yikang Shen Skills & Research Interests

Deep Learning

Natural Language Processing

Top articles of Yikang Shen

JetMoE: Reaching Llama2 Performance with 0.1 M Dollars

arXiv preprint arXiv:2404.07413

2024/4/11

Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models

arXiv preprint arXiv:2404.05567

2024/4/8

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

arXiv preprint arXiv:2403.09472

2024/3/14

Scattered Mixture-of-Experts Implementation

arXiv preprint arXiv:2403.08245

2024/3/13

Visual Chain-of-Thought Prompting for Knowledge-based Visual Reasoning

2024/2/18

API Pack: A Massive Multilingual Dataset for API Call Generation

arXiv preprint arXiv:2402.09615

2024/2/14

Adaptive Online Replanning with Diffusion Models

NeurIPS

2023/10/14

Principle-driven self-alignment of language models from scratch with minimal human supervision

Advances in Neural Information Processing Systems

2024/2/13

Diversity Measurement and Subset Selection for Instruction Tuning Datasets

arXiv preprint arXiv:2402.02318

2024/2/4

Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble

arXiv preprint arXiv:2401.16635

2024/1/30

Structured Code Representations Enable Data-Efficient Adaptation of Code Language Models

arXiv preprint arXiv:2401.10716

2024/1/19

Gated linear attention transformers with hardware-efficient training

arXiv preprint arXiv:2312.06635

2023/12/11

Yikang Shen
Yikang Shen

H-Index: 8

Yoon Kim
Yoon Kim

H-Index: 11

CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding

arXiv preprint arXiv:2311.03354

2023/11/6

Autonomous Tree-search Ability of Large Language Models

arXiv preprint arXiv:2310.10686

2023/10/14

Yikang Shen
Yikang Shen

H-Index: 8

Chuang Gan
Chuang Gan

H-Index: 37

Compositional VLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding

2023/10/13

The consensus game: Language model generation via equilibrium search

ICLR

2024

Sparse universal transformer

arXiv preprint arXiv:2310.07096

2023/10/11

Salmon: Self-alignment with principle-following reward models

International Conference on Learning Representations (ICLR)

2024/1/16

Graphtext: Graph reasoning in text space

arXiv preprint arXiv:2310.01089

2023/10/2

Aligning large multimodal models with factually augmented rlhf

arXiv preprint arXiv:2309.14525

2023/9/25

See List of Professors in Yikang Shen University(Université de Montréal)

Co-Authors

academic-engine