Zongqing Lu
Peking University
H-index: 28
Asia-China
Top articles of Zongqing Lu
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Understanding What Affects Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence | arXiv preprint arXiv:2402.02701 | Jiafei Lyu Le Wan Xiu Li Zongqing Lu | 2024/2/5 |
Towards general computer control: A multimodal agent for red dead redemption ii as a case study | arXiv preprint arXiv:2403.03186 | Weihao Tan Ziluo Ding Wentao Zhang Boyu Li Bohan Zhou | 2024/3/5 |
Settling Decentralized Multi-Agent Coordinated Exploration by Novelty Sharing | arXiv preprint arXiv:2402.02097 | Haobin Jiang Ziluo Ding Zongqing Lu | 2024/2/3 |
LLaMA-Rider: Spurring Large Language Models to Explore the Open World | arXiv preprint arXiv:2310.08922 | Yicheng Feng Yuxuan Wang Jiazheng Liu Sipeng Zheng Zongqing Lu | 2023/10/13 |
Towards Understanding How to Reduce Generalization Gap in Visual Reinforcement Learning | Jiafei Lyu Le Wan Xiu Li Zongqing Lu | 2024/5/6 | |
Learning Multi-Object Positional Relationships via Emergent Communication | Yicheng Feng Boshi An Zongqing Lu | 2024/2 | |
RL-GPT: Integrating Reinforcement Learning and Code-as-policy | arXiv preprint arXiv:2402.19299 | Shaoteng Liu Haoqi Yuan Minda Hu Yanwei Li Yukang Chen | 2024/2/29 |
Off-policy RL algorithms can be sample-efficient for continuous control via sample multiple reuse | Information Sciences | Jiafei Lyu Le Wan Xiu Li Zongqing Lu | 2024/5/1 |
Fully Decentralized Cooperative Multi-Agent Reinforcement Learning: A Survey | arXiv preprint arXiv:2401.04934 | Jiechuan Jiang Kefan Su Zongqing Lu | 2024/1/10 |
SEABO: A Simple Search-Based Method for Offline Imitation Learning | Jiafei Lyu Xiaoteng Ma Le Wan Runze Liu Xiu Li | 2024/2/6 | |
MTLight: Efficient multi-task reinforcement learning for traffic signal control | arXiv preprint arXiv:2404.00886 | Liwen Zhu Peixi Peng Zongqing Lu Yonghong Tian | 2024/4/1 |
Multi-Agent Alternate Q-Learning | Kefan Su Siyuan Zhou Jiechuan Jiang Chuang Gan Xiangjun Wang | 2024/5/6 | |
UniCode: Learning a Unified Codebook for Multimodal Large Language Models | arXiv preprint arXiv:2403.09072 | Sipeng Zheng Bohan Zhou Yicheng Feng Ye Wang Zongqing Lu | 2024/3/14 |
Steve-Eye: Equipping LLM-based Embodied Agents with Visual Perception in Open Worlds | Sipeng Zheng Yicheng Feng Zongqing Lu | 2023/10/13 | |
Adaptive Learning Rates for Multi-Agent Reinforcement Learning | Jiechuan Jiang Zongqing Lu | 2020/10/2 | |
State Advantage Weighting for Offline RL | arXiv preprint arXiv:2210.04251 | Jiafei Lyu Aicheng Gong Le Wan Zongqing Lu Xiu Li | 2022/10/9 |
Entity Divider with Language Grounding in Multi-Agent Reinforcement Learning | Ziluo Ding Wanpeng Zhang Junpeng Yue Xiangjun Wang Tiejun Huang | 2023/7/3 | |
Tackling Non-Stationarity in Reinforcement Learning via Causal-Origin Representation | arXiv preprint arXiv:2306.02747 | Wanpeng Zhang Yilin Li Boyu Yang Zongqing Lu | 2023/6/5 |
Bi-DexHands: Towards Human-Level Bimanual Dexterous Manipulation | IEEE Transactions on Pattern Analysis and Machine Intelligence | Yuanpei Chen Yiran Geng Fangwei Zhong Jiaming Ji Jiechuang Jiang | 2023/12/5 |
A survey on transformers in reinforcement learning | arXiv preprint arXiv:2301.03044 | Wenzhe Li Hao Luo Zichuan Lin Chongjie Zhang Zongqing Lu | 2023/1/8 |