Yang Yu
Nanjing University
H-index: 39
Asia-China
Top articles of Yang Yu
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Communication-robust multi-agent learning by adaptable auxiliary multi-agent adversary generation | Frontiers of Computer Science | Lei Yuan Feng Chen Zongzhang Zhang Yang Yu | 2024/12 |
Natural Language Instruction-following with Task-related Language Development and Translation | Advances in Neural Information Processing Systems | Jing-Cheng Pang Xin-Yu Yang Si-Hang Yang Xiong-Hui Chen Yang Yu | 2024/2/13 |
Linda: Multi-agent local information decomposition for awareness of teammates | Science China Information Sciences | Jiahan Cao Lei Yuan Jianhao Wang Shaowei Zhang Chongjie Zhang | 2023/8 |
Remax: A simple, effective, and efficient method for aligning large language models | arXiv preprint arXiv:2310.10505 | Ziniu Li Tian Xu Yushun Zhang Yang Yu Ruoyu Sun | 2023/10/16 |
Policy Optimization in RLHF: The Impact of Out-of-preference Data | arXiv preprint arXiv:2312.10584 | Ziniu Li Tian Xu Yang Yu | 2023/12/17 |
Self-Motivated Multi-Agent Exploration | arXiv preprint arXiv:2301.02083 | Shaowei Zhang Jiahan Cao Lei Yuan Yang Yu De-Chuan Zhan | 2023/1/5 |
Generalizable Task Representation Learning for Offline Meta-Reinforcement Learning with Data Limitations | arXiv preprint arXiv:2312.15909 | Renzhe Zhou Chen-Xiao Gao Zongzhang Zhang Yang Yu | 2023/12/26 |
Robust Multi-agent Communication via Multi-view Message Certification | arXiv preprint arXiv:2305.13936 | Lei Yuan Tao Jiang Lihe Li Feng Chen Zongzhang Zhang | 2023/5/7 |
Provably Efficient Adversarial Imitation Learning with Unknown Transitions | arXiv preprint arXiv:2306.06563 | Tian Xu Ziniu Li Yang Yu Zhi-Quan Luo | 2023/6/11 |
Learning Physically Realizable Skills for Online Packing of General 3D Shapes | ACM Transactions on Graphics | Hang Zhao Zherong Pan Yang Yu Kai Xu | 2023/7/28 |
Imitator Learning: Achieve Out-of-the-Box Imitation Ability in Variable Environments | arXiv preprint arXiv:2310.05712 | Xiong-Hui Chen Junyin Ye Hang Zhao Yi-Chen Li Haoran Shi | 2023/10/9 |
UDCA may promote COVID-19 recovery: a cohort study with AI-aided analysis | MedRxiv | Yang Yu Guo Yu Lu-Yao Han Jian Li Zhi-Long Zhang | 2023 |
Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems | arXiv preprint arXiv:2305.04832 | Xiong-Hui Chen Bowei He Yang Yu Qingyang Li Zhiwei Qin | 2023/5/3 |
Learning World Models with Identifiable Factorization | arXiv preprint arXiv:2306.06561 | Yu-Ren Liu Biwei Huang Zhengmao Zhu Honglong Tian Mingming Gong | 2023/6/11 |
Mixlight: Mixed-agent cooperative reinforcement learning for traffic light control | IEEE Transactions on Industrial Informatics | Ming Yang Yiming Wang Yang Yu Mingliang Zhou | 2023/7/27 |
Learning to Coordinate with Anyone | arXiv preprint arXiv:2309.12633 | Lei Yuan Lihe Li Ziqian Zhang Feng Chen Tianyi Zhang | 2023/9/22 |
A Survey of Progress on Cooperative Multi-agent Reinforcement Learning in Open Environment | Lei Yuan Ziqian Zhang Lihe Li Cong Guan Yang Yu | 2023/12/2 | |
Multi-Task Multi-Agent Shared Layers are Universal Cognition of Multi-Agent Coordination | arXiv preprint arXiv:2312.15674 | Jiawei Wang Jian Zhao Zhengtao Cao Ruili Feng Rongjun Qin | 2023/12/25 |
How To Guide Your Learner: Imitation Learning with Active Adaptive Expert Involvement | arXiv preprint arXiv:2303.02073 | Xu-Hui Liu Feng Xu Xinyu Zhang Tianyuan Liu Shengyi Jiang | 2023/3/3 |
Language Model Self-improvement by Reinforcement Learning Contemplation | arXiv preprint arXiv:2305.14483 | Jing-Cheng Pang Pengyuan Wang Kaiyuan Li Xiong-Hui Chen Jiacheng Xu | 2023/5/23 |