Yi Wu
Tsinghua University
H-index: 24
Asia-China
Top articles of Yi Wu
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control | IEEE Robotics and Automation Letters | Botian Xu Feng Gao Chao Yu Ruize Zhang Yi Wu | 2024/1/19 |
Accelerate Multi-Agent Reinforcement Learning in Zero-Sum Games with Subgame Curriculum Learning | Proceedings of the AAAI Conference on Artificial Intelligence | Jiayu Chen Zelai Xu Yunfei Li Chao Yu Jiaming Song | 2024/3/24 |
Iteratively learn diverse strategies with state distance information | Advances in Neural Information Processing Systems | Wei Fu Weihua Du Jingwei Li Sunli Chen Jingzhao Zhang | 2024/2/13 |
Learning zero-shot cooperation with humans, assuming humans are biased | arXiv preprint arXiv:2302.01605 | Chao Yu Jiaxuan Gao Weilin Liu Botian Xu Hao Tang | 2023/2/3 |
Automatic truss design with reinforcement learning | arXiv preprint arXiv:2306.15182 | Weihua Du Jinglun Zhao Chao Yu Xingcheng Yao Zimeng Song | 2023/6/27 |
Bitnet: Scaling 1-bit transformers for large language models | arXiv preprint arXiv:2310.11453 | Hongyu Wang Shuming Ma Li Dong Shaohan Huang Huaijie Wang | 2023/10/17 |
Asynchronous multi-agent reinforcement learning for efficient real-time multi-robot cooperative exploration | arXiv preprint arXiv:2301.03398 | Chao Yu Xinyi Yang Jiaxuan Gao Jiayu Chen Yunfei Li | 2023/1/9 |
Maximum entropy population-based training for zero-shot human-ai coordination | Proceedings of the AAAI Conference on Artificial Intelligence | Rui Zhao Jinming Song Yufeng Yuan Haifeng Hu Yang Gao | 2023/6/26 |
Fictitious cross-play: Learning global nash equilibrium in mixed cooperative-competitive games | Zelai Xu Yancheng Liang Chao Yu Yu Wang Yi Wu | 2023/5/30 | |
Llm-powered hierarchical language agent for real-time human-ai coordination | AAMAS 2024 | Jijia Liu* Chao Yu* Jiaxuan Gao* Yuqing Xie Qingmin Liao | 2023/12/23 |
Efficient bimanual handover and rearrangement via symmetry-aware actor-critic learning | Yunfei Li Chaoyi Pan Huazhe Xu Xiaolong Wang Yi Wu | 2023/5/29 | |
AlphaSnake: policy iteration on a nondeterministic NP-hard Markov decision process (student abstract) | Proceedings of the AAAI Conference on Artificial Intelligence | Kevin Du Ian Gemp Yi Wu Yingying Wu | 2023/9/6 |
Learning Agile Bipedal Motions on a Quadrupedal Robot | arXiv preprint arXiv:2311.05818 | Yunfei Li Jinhan Li Wei Fu Yi Wu | 2023/11/10 |
Grounding object relations in language-conditioned robotic manipulation with semantic-spatial reasoning | arXiv preprint arXiv:2303.17919 | Qian Luo Yunfei Li Yi Wu | 2023/3/31 |
Quarl: A learning-based quantum circuit optimizer | arXiv preprint arXiv:2307.10120 | Zikun Li Jinjun Peng Yixuan Mei Sina Lin Yi Wu | 2023/7/17 |
Language agents with reinforcement learning for strategic play in the werewolf game | arXiv preprint arXiv:2310.18940 | Zelai Xu Chao Yu Fei Fang Yu Wang Yi Wu | 2023/10/29 |
Differentiable Arbitrating in Zero-sum Markov Games | Jing Wang Meichen Song Feng Gao Boyi Liu Zhaoran Wang | 2023/2/20 | |
SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores | arXiv preprint arXiv:2306.16688 | Zhiyu Mei Wei Fu Guangju Wang Huanchen Zhang Yi Wu | 2023/6/29 |
LAGOON: Language-Guided Motion Control | Shusheng Xu Huaijie Wang Yutao Ouyang Jiaxuan Gao Zhiyu Mei | 2023/10/21 | |
PhyloTransformer: A Self-supervised Discriminative Model for SARS-CoV-2 Viral Mutation Prediction Based on a Multi-head Self-attention Mechanism | Yingying Wu Shusheng Xu Shing-Tung Yau Yi Wu | 2022 |