Jianfei Chen
Tsinghua University
H-index: 22
Asia-China
Top articles of Jianfei Chen
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Memory efficient optimizers with 4-bit states | Advances in Neural Information Processing Systems | Bingrui Li Jianfei Chen Jun Zhu | 2024/2/13 |
Accelerating Transformer Pre-Training with 2: 4 Sparsity | arXiv preprint arXiv:2404.01847 | Yuezhou Hu Kang Zhao Weiyu Huang Jianfei Chen Jun Zhu | 2024/4/2 |
Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization | arXiv preprint arXiv:2403.12422 | Haocheng Xi Yuxiang Chen Kang Zhao Kaijun Zheng Jianfei Chen | 2024/3/19 |
Efficient Backpropagation with Variance-Controlled Adaptive Sampling | arXiv preprint arXiv:2402.17227 | Ziteng Wang Jianfei Chen Jun Zhu | 2024/2/27 |
C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory | arXiv preprint arXiv:2402.16349 | Tianjiao Luo Tim Pearce Huayu Chen Jianfei Chen Jun Zhu | 2024/2/26 |
Dpm-solver-v3: Improved diffusion ode solver with empirical model statistics | Advances in Neural Information Processing Systems | Kaiwen Zheng Cheng Lu Jianfei Chen Jun Zhu | 2024/2/13 |
Contrastive energy prediction for exact energy-guided diffusion sampling in offline reinforcement learning | arXiv preprint arXiv:2304.12824 | Cheng Lu Huayu Chen Jianfei Chen Hang Su Chongxuan Li | 2023/4/25 |
Preserving pre-trained features helps calibrate fine-tuned language models | arXiv preprint arXiv:2305.19249 | Guande He Jianfei Chen Jun Zhu | 2023/5/30 |
Training transformers with 4-bit integers | Advances in Neural Information Processing Systems | Haocheng Xi Changhao Li Jianfei Chen Jun Zhu | 2023/12/15 |
Parameter-efficient fine-tuning of large-scale pre-trained language models | Nature Machine Intelligence | Ning Ding Yujia Qin Guang Yang Fuchao Wei Zonghan Yang | 2023/3 |
Investigating uncertainty calibration of aligned language models under the multiple-choice setting | arXiv preprint arXiv:2310.11732 | Guande He Peng Cui Jianfei Chen Wenbo Hu Jun Zhu | 2023/10/18 |
Improved techniques for maximum likelihood estimation for diffusion odes | Kaiwen Zheng Cheng Lu Jianfei Chen Jun Zhu | 2023/7/3 | |
Stabilizing gans’ training with brownian motion controller | Tianjiao Luo Ziyu Zhu Jianfei Chen Jun Zhu | 2023/7/3 | |
Fast lossless neural compression with integer-only discrete flows | Siyu Wang Jianfei Chen Chongxuan Li Jun Zhu Bo Zhang | 2022/6/28 | |
Maximum likelihood training for score-based diffusion odes by high order denoising score matching | Cheng Lu Kaiwen Zheng Fan Bao Jianfei Chen Chongxuan Li | 2022/6/28 | |
Gact: Activation compressed training for generic network architectures | Xiaoxuan Liu Lianmin Zheng Dequan Wang Yukuo Cen Weize Chen | 2022/6/28 | |
Deep ensemble as a Gaussian process approximate posterior | arXiv preprint arXiv:2205.00163 | Zhijie Deng Feng Zhou Jianfei Chen Guoqiang Wu Jun Zhu | 2022/4/30 |
Dpm-solver: A fast ode solver for diffusion probabilistic model sampling in around 10 steps | Advances in Neural Information Processing Systems | Cheng Lu Yuhao Zhou Fan Bao Jianfei Chen Chongxuan Li | 2022/12/6 |
Delta tuning: A comprehensive study of parameter efficient methods for pre-trained language models | arXiv preprint arXiv:2203.06904 | Ning Ding Yujia Qin Guang Yang Fuchao Wei Zonghan Yang | 2022/3/14 |
Dpm-solver++: Fast solver for guided sampling of diffusion probabilistic models | arXiv preprint arXiv:2211.01095 | Cheng Lu Yuhao Zhou Fan Bao Jianfei Chen Chongxuan Li | 2022/11/2 |