Yi Ma (马毅)
University of California, Berkeley
H-index: 82
North America-United States
Top articles of Yi Ma (马毅)
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Eyes wide shut? exploring the visual shortcomings of multimodal llms | arXiv preprint arXiv:2401.06209 | Shengbang Tong Zhuang Liu Yuexiang Zhai Yi Ma Yann LeCun | 2024/1/11 |
A Trajectory Perspective on the Role of Data Sampling Techniques in Offline Reinforcement Learning | Jinyi Liu Yi Ma Jianye Hao Yujing Hu Yan Zheng | 2024/5/6 | |
Investigating the Catastrophic Forgetting in Multimodal Large Language Model Fine-Tuning | Yuexiang Zhai Shengbang Tong Xiao Li Mu Cai Qing Qu | 2024/1/8 | |
Reining Generalization in Offline Reinforcement Learning via Representation Distinction | Advances in Neural Information Processing Systems | Yi Ma Hongyao Tang Dong Li Zhaopeng Meng | 2024/2/13 |
System and method for extracting planar surface from depth image | 2024/1/2 | ||
White-box transformers via sparse rate reduction | Advances in Neural Information Processing Systems | Yaodong Yu Sam Buchanan Druv Pai Tianzhe Chu Ziyang Wu | 2024/2/13 |
Cal-ql: Calibrated offline rl pre-training for efficient online fine-tuning | Neural Information Processing Systems (NeurIPS) | Mitsuhiko Nakamoto Yuexiang Zhai Anikait Singh Max Sobol Mark Yi Ma | 2023/3/9 |
Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback | arXiv preprint arXiv:2402.02423 | Yifu Yuan Jianye Hao Yi Ma Zibin Dong Hebin Liang | 2024/2/4 |
ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles | Kai Zhao Jianye Hao Yi Ma Jinyi Liu Yan Zheng | 2024/5/6 | |
Emp-ssl: Towards self-supervised learning in one training epoch | arXiv preprint arXiv:2304.03977 | Shengbang Tong Yubei Chen Yi Ma Yann Lecun | 2023/4/8 |
Improving Offline-to-Online Reinforcement Learning with Q-Ensembles | Kai Zhao Yi Ma Jinyi Liu HAO Jianye Yan Zheng | 2023/7/9 | |
Ensemble-based offline-to-online reinforcement learning: From pessimistic learning to optimistic exploration | arXiv preprint arXiv:2306.06871 | Kai Zhao Yi Ma Jinyi Liu Yan Zheng Zhaopeng Meng | 2023/6/12 |
Iteratively Refined Behavior Regularization for Offline Reinforcement Learning | Xiaohan Hu Yi Ma Chenjun Xiao Yan Zheng HAO Jianye | 2023/10/13 | |
General in-hand object rotation with vision and touch | Conference on Robot Learning (CoRL) | Haozhi Qi Brent Yi Sudharshan Suresh Mike Lambeta Yi Ma | 2023/9/18 |
In-hand object rotation via rapid motor adaptation | Haozhi Qi Ashish Kumar Roberto Calandra Yi Ma Jitendra Malik | 2022/12 | |
A Policy-Decoupled Method for High-Quality Data Augmentation in Offline Reinforcement Learning | Shixi Lian Yi Ma Jinyi Liu HAO Jianye Yan Zheng | 2023/7/9 | |
HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach | arXiv preprint arXiv:2306.06329 | Shixi Lian Yi Ma Jinyi Liu Yan Zheng Zhaopeng Meng | 2023/6/10 |
Rethinking Decision Transformer via Hierarchical Reinforcement Learning | Yi Ma Chenjun Xiao Hebin Liang HAO Jianye | 2023/10/13 | |
Masked Completion via Structured Diffusion with White-Box Transformers | Druv Pai Ziyang Wu Sam Buchanan Tianzhe Chu Yaodong Yu | 2023/12/1 | |
Closed-loop transcription via convolutional sparse coding | arXiv preprint arXiv:2302.09347 | Xili Dai Ke Chen Shengbang Tong Jingyuan Zhang Xingjian Gao | 2023/2/18 |