ProfessorsProfessors of Nanjing UniversityYang Yu

Yang Yu

Nanjing University

H-index: 39

Asia-China

About Yang Yu

Yang Yu, With an exceptional h-index of 39 and a recent h-index of 33 (since 2020), a distinguished researcher at Nanjing University, specializes in the field of Artificial Intelligence, Reinforcement Learning, Evolutionary Algorithms.

His recent articles reflect a diverse array of research interests and contributions to the field:

Communication-robust multi-agent learning by adaptable auxiliary multi-agent adversary generation

Natural Language Instruction-following with Task-related Language Development and Translation

Linda: Multi-agent local information decomposition for awareness of teammates

Remax: A simple, effective, and efficient method for aligning large language models

Policy Optimization in RLHF: The Impact of Out-of-preference Data

Self-Motivated Multi-Agent Exploration

Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations

Robust Multi-agent Communication via Multi-view Message Certification

Yang Yu Information

University	Nanjing University
Position	Professor
Citations(all)	5418
Citations(since 2020)	4322
Cited By	2256
hIndex(all)	39
hIndex(since 2020)	33
i10Index(all)	92
i10Index(since 2020)	83
Email	Access Email
University Profile Page	Nanjing University
Google Scholar	View Google Scholar Profile

Yang Yu Skills & Research Interests

Artificial Intelligence

Reinforcement Learning

Evolutionary Algorithms

Top articles of Yang Yu

Title	Journal	Author(s)	Publication Date
Communication-robust multi-agent learning by adaptable auxiliary multi-agent adversary generation	Frontiers of Computer Science	Lei Yuan Feng Chen Zongzhang Zhang Yang Yu	2024/12
Natural Language Instruction-following with Task-related Language Development and Translation	Advances in Neural Information Processing Systems	Jing-Cheng Pang Xin-Yu Yang Si-Hang Yang Xiong-Hui Chen Yang Yu	2024/2/13
Linda: Multi-agent local information decomposition for awareness of teammates	Science China Information Sciences	Jiahan Cao Lei Yuan Jianhao Wang Shaowei Zhang Chongjie Zhang ...	2023/8
Remax: A simple, effective, and efficient method for aligning large language models	arXiv preprint arXiv:2310.10505	Ziniu Li Tian Xu Yushun Zhang Yang Yu Ruoyu Sun ...	2023/10/16
Policy Optimization in RLHF: The Impact of Out-of-preference Data	arXiv preprint arXiv:2312.10584	Ziniu Li Tian Xu Yang Yu	2023/12/17
Self-Motivated Multi-Agent Exploration	arXiv preprint arXiv:2301.02083	Shaowei Zhang Jiahan Cao Lei Yuan Yang Yu De-Chuan Zhan	2023/1/5
Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations	arXiv preprint arXiv:2312.15909	Renzhe Zhou Chen-Xiao Gao Zongzhang Zhang Yang Yu	2023/12/26
Robust Multi-agent Communication via Multi-view Message Certification	arXiv preprint arXiv:2305.13936	Lei Yuan Tao Jiang Lihe Li Feng Chen Zongzhang Zhang ...	2023/5/7
Provably Efficient Adversarial Imitation Learning with Unknown Transitions	arXiv preprint arXiv:2306.06563	Tian Xu Ziniu Li Yang Yu Zhi-Quan Luo	2023/6/11
Learning Physically Realizable Skills for Online Packing of General 3D Shapes	ACM Transactions on Graphics	Hang Zhao Zherong Pan Yang Yu Kai Xu	2023/7/28
Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments	arXiv preprint arXiv:2310.05712	Xiong-Hui Chen Junyin Ye Hang Zhao Yi-Chen Li Haoran Shi ...	2023/10/9
UDCA may promote COVID-19 recovery: a cohort study with AI-aided analysis	MedRxiv	Yang Yu Guo Yu Lu-Yao Han Jian Li Zhi-Long Zhang ...	2023
Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems	arXiv preprint arXiv:2305.04832	Xiong-Hui Chen Bowei He Yang Yu Qingyang Li Zhiwei Qin ...	2023/5/3
Learning World Models with Identifiable Factorization	arXiv preprint arXiv:2306.06561	Yu-Ren Liu Biwei Huang Zhengmao Zhu Honglong Tian Mingming Gong ...	2023/6/11
Mixlight: Mixed-agent cooperative reinforcement learning for traffic light control	IEEE Transactions on Industrial Informatics	Ming Yang Yiming Wang Yang Yu Mingliang Zhou	2023/7/27
Learning to Coordinate with Anyone	arXiv preprint arXiv:2309.12633	Lei Yuan Lihe Li Ziqian Zhang Feng Chen Tianyi Zhang ...	2023/9/22
A Survey of Progress on Cooperative Multi-agent Reinforcement Learning in Open Environment		Lei Yuan Ziqian Zhang Lihe Li Cong Guan Yang Yu	2023/12/2
Multi-Task Multi-Agent Shared Layers are Universal Cognition of Multi-Agent Coordination	arXiv preprint arXiv:2312.15674	Jiawei Wang Jian Zhao Zhengtao Cao Ruili Feng Rongjun Qin ...	2023/12/25
How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement	arXiv preprint arXiv:2303.02073	Xu-Hui Liu Feng Xu Xinyu Zhang Tianyuan Liu Shengyi Jiang ...	2023/3/3
Language Model Self-improvement by Reinforcement Learning Contemplation	arXiv preprint arXiv:2305.14483	Jing-Cheng Pang Pengyuan Wang Kaiyuan Li Xiong-Hui Chen Jiacheng Xu ...	2023/5/23