Yikang Shen
Université de Montréal
H-index: 16
North America-Canada
Top articles of Yikang Shen
JetMoE: Reaching Llama2 Performance with 0.1 M Dollars
arXiv preprint arXiv:2404.07413
2024/4/11
Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models
arXiv preprint arXiv:2404.05567
2024/4/8
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
arXiv preprint arXiv:2403.09472
2024/3/14
Scattered Mixture-of-Experts Implementation
arXiv preprint arXiv:2403.08245
2024/3/13
Visual Chain-of-Thought Prompting for Knowledge-based Visual Reasoning
2024/2/18
API Pack: A Massive Multilingual Dataset for API Call Generation
arXiv preprint arXiv:2402.09615
2024/2/14
Adaptive Online Replanning with Diffusion Models
NeurIPS
2023/10/14
Principle-driven self-alignment of language models from scratch with minimal human supervision
Advances in Neural Information Processing Systems
2024/2/13
Diversity Measurement and Subset Selection for Instruction Tuning Datasets
arXiv preprint arXiv:2402.02318
2024/2/4
Peiqi Wang
H-Index: 5
Yikang Shen
H-Index: 8
Zhen Guo
H-Index: 2
Yoon Kim
H-Index: 11
Polina Golland
H-Index: 32
Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble
arXiv preprint arXiv:2401.16635
2024/1/30
Structured Code Representations Enable Data-Efficient Adaptation of Code Language Models
arXiv preprint arXiv:2401.10716
2024/1/19
Gated linear attention transformers with hardware-efficient training
arXiv preprint arXiv:2312.06635
2023/12/11
Yikang Shen
H-Index: 8
Yoon Kim
H-Index: 11
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding
arXiv preprint arXiv:2311.03354
2023/11/6
Autonomous Tree-search Ability of Large Language Models
arXiv preprint arXiv:2310.10686
2023/10/14
Yikang Shen
H-Index: 8
Chuang Gan
H-Index: 37
Compositional VLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding
2023/10/13
The consensus game: Language model generation via equilibrium search
ICLR
2024
Sparse universal transformer
arXiv preprint arXiv:2310.07096
2023/10/11
Salmon: Self-alignment with principle-following reward models
International Conference on Learning Representations (ICLR)
2024/1/16
Graphtext: Graph reasoning in text space
arXiv preprint arXiv:2310.01089
2023/10/2
Aligning large multimodal models with factually augmented rlhf
arXiv preprint arXiv:2309.14525
2023/9/25