Yang Yu

Yang Yu

Nanjing University

H-index: 39

Asia-China

About Yang Yu

Yang Yu, With an exceptional h-index of 39 and a recent h-index of 33 (since 2020), a distinguished researcher at Nanjing University, specializes in the field of Artificial Intelligence, Reinforcement Learning, Evolutionary Algorithms.

His recent articles reflect a diverse array of research interests and contributions to the field:

Communication-robust multi-agent learning by adaptable auxiliary multi-agent adversary generation

Natural Language Instruction-following with Task-related Language Development and Translation

Linda: Multi-agent local information decomposition for awareness of teammates

Remax: A simple, effective, and efficient method for aligning large language models

Policy Optimization in RLHF: The Impact of Out-of-preference Data

Self-Motivated Multi-Agent Exploration

Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations

Robust Multi-agent Communication via Multi-view Message Certification

Yang Yu Information

University

Position

Professor

Citations(all)

5418

Citations(since 2020)

4322

Cited By

2256

hIndex(all)

39

hIndex(since 2020)

33

i10Index(all)

92

i10Index(since 2020)

83

Email

University Profile Page

Nanjing University

Google Scholar

View Google Scholar Profile

Yang Yu Skills & Research Interests

Artificial Intelligence

Reinforcement Learning

Evolutionary Algorithms

Top articles of Yang Yu

Title

Journal

Author(s)

Publication Date

Communication-robust multi-agent learning by adaptable auxiliary multi-agent adversary generation

Frontiers of Computer Science

Lei Yuan

Feng Chen

Zongzhang Zhang

Yang Yu

2024/12

Natural Language Instruction-following with Task-related Language Development and Translation

Advances in Neural Information Processing Systems

Jing-Cheng Pang

Xin-Yu Yang

Si-Hang Yang

Xiong-Hui Chen

Yang Yu

2024/2/13

Linda: Multi-agent local information decomposition for awareness of teammates

Science China Information Sciences

Jiahan Cao

Lei Yuan

Jianhao Wang

Shaowei Zhang

Chongjie Zhang

...

2023/8

Remax: A simple, effective, and efficient method for aligning large language models

arXiv preprint arXiv:2310.10505

Ziniu Li

Tian Xu

Yushun Zhang

Yang Yu

Ruoyu Sun

...

2023/10/16

Policy Optimization in RLHF: The Impact of Out-of-preference Data

arXiv preprint arXiv:2312.10584

Ziniu Li

Tian Xu

Yang Yu

2023/12/17

Self-Motivated Multi-Agent Exploration

arXiv preprint arXiv:2301.02083

Shaowei Zhang

Jiahan Cao

Lei Yuan

Yang Yu

De-Chuan Zhan

2023/1/5

Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations

arXiv preprint arXiv:2312.15909

Renzhe Zhou

Chen-Xiao Gao

Zongzhang Zhang

Yang Yu

2023/12/26

Robust Multi-agent Communication via Multi-view Message Certification

arXiv preprint arXiv:2305.13936

Lei Yuan

Tao Jiang

Lihe Li

Feng Chen

Zongzhang Zhang

...

2023/5/7

Provably Efficient Adversarial Imitation Learning with Unknown Transitions

arXiv preprint arXiv:2306.06563

Tian Xu

Ziniu Li

Yang Yu

Zhi-Quan Luo

2023/6/11

Learning Physically Realizable Skills for Online Packing of General 3D Shapes

ACM Transactions on Graphics

Hang Zhao

Zherong Pan

Yang Yu

Kai Xu

2023/7/28

Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments

arXiv preprint arXiv:2310.05712

Xiong-Hui Chen

Junyin Ye

Hang Zhao

Yi-Chen Li

Haoran Shi

...

2023/10/9

UDCA may promote COVID-19 recovery: a cohort study with AI-aided analysis

MedRxiv

Yang Yu

Guo Yu

Lu-Yao Han

Jian Li

Zhi-Long Zhang

...

2023

Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems

arXiv preprint arXiv:2305.04832

Xiong-Hui Chen

Bowei He

Yang Yu

Qingyang Li

Zhiwei Qin

...

2023/5/3

Learning World Models with Identifiable Factorization

arXiv preprint arXiv:2306.06561

Yu-Ren Liu

Biwei Huang

Zhengmao Zhu

Honglong Tian

Mingming Gong

...

2023/6/11

Mixlight: Mixed-agent cooperative reinforcement learning for traffic light control

IEEE Transactions on Industrial Informatics

Ming Yang

Yiming Wang

Yang Yu

Mingliang Zhou

2023/7/27

Learning to Coordinate with Anyone

arXiv preprint arXiv:2309.12633

Lei Yuan

Lihe Li

Ziqian Zhang

Feng Chen

Tianyi Zhang

...

2023/9/22

A Survey of Progress on Cooperative Multi-agent Reinforcement Learning in Open Environment

Lei Yuan

Ziqian Zhang

Lihe Li

Cong Guan

Yang Yu

2023/12/2

Multi-Task Multi-Agent Shared Layers are Universal Cognition of Multi-Agent Coordination

arXiv preprint arXiv:2312.15674

Jiawei Wang

Jian Zhao

Zhengtao Cao

Ruili Feng

Rongjun Qin

...

2023/12/25

How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement

arXiv preprint arXiv:2303.02073

Xu-Hui Liu

Feng Xu

Xinyu Zhang

Tianyuan Liu

Shengyi Jiang

...

2023/3/3

Language Model Self-improvement by Reinforcement Learning Contemplation

arXiv preprint arXiv:2305.14483

Jing-Cheng Pang

Pengyuan Wang

Kaiyuan Li

Xiong-Hui Chen

Jiacheng Xu

...

2023/5/23

See List of Professors in Yang Yu University(Nanjing University)

Co-Authors

H-index: 125
Zhi-Hua Zhou

Zhi-Hua Zhou

Nanjing University

H-index: 122
Xin Yao

Xin Yao

University of Birmingham

H-index: 64
Weinan Zhang

Weinan Zhang

Shanghai Jiao Tong University

H-index: 34
Yu-Feng Li

Yu-Feng Li

Nanjing University

H-index: 31
Chongjie zhang

Chongjie zhang

Tsinghua University

H-index: 30
De-Chuan Zhan

De-Chuan Zhan

Nanjing University

academic-engine