Jingzhao Zhang
Massachusetts Institute of Technology
H-index: 15
North America-United States
Top articles of Jingzhao Zhang
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
On the overlooked pitfalls of weight decay and how to mitigate them: A gradient-norm perspective | Advances in Neural Information Processing Systems | Zeke Xie Zhiqiang Xu Jingzhao Zhang Issei Sato Masashi Sugiyama | 2024/2/13 |
Benign Overfitting in Classification: Provably Counter Label Noise with Larger Models | Kaiyue Wen Jiaye Teng Jingzhao Zhang | 2023 | |
Realistic fault detection of li-ion battery via dynamical deep learning | Nature Communications | Jingzhao Zhang Yanan Wang Benben Jiang Haowei He Shaobo Huang | 2023/9/23 |
Two Phases of Scaling Laws for Nearest Neighbor Classifiers | arXiv preprint arXiv:2308.08247 | Pengkun Yang Jingzhao Zhang | 2023/8/16 |
Near-optimal fully first-order algorithms for finding stationary points in bilevel optimization | arXiv preprint arXiv:2306.14853 | Lesi Chen Yaohua Ma Jingzhao Zhang | 2023/6/26 |
Sion’s minimax theorem in geodesic metric spaces and a Riemannian extragradient algorithm | SIAM Journal on Optimization | Peiyuan Zhang Jingzhao Zhang Suvrit Sra | 2023/12/31 |
Fast Conditional Mixing of MCMC Algorithms for Non-log-concave Distributions | Advances in Neural Information Processing Systems | Xiang Cheng Bohan Wang Jingzhao Zhang Yusong Zhu | 2024/2/13 |
Iteratively Learn Diverse Strategies with State Distance Information | Advances in Neural Information Processing Systems | Wei Fu Weihua Du Jingwei Li Sunli Chen Jingzhao Zhang | 2024/2/13 |
Lower Generalization Bounds for GD and SGD in Smooth Stochastic Convex Optimization | arXiv preprint arXiv:2303.10758 | Peiyuan Zhang Jiaye Teng Jingzhao Zhang | 2023/3/19 |
A Quadratic Synchronization Rule for Distributed Deep Learning | arXiv preprint arXiv:2310.14423 | Xinran Gu Kaifeng Lyu Sanjeev Arora Jingzhao Zhang Longbo Huang | 2023/10/22 |
Efficient sampling on Riemannian manifolds via Langevin MCMC | Advances in Neural Information Processing Systems | Xiang Cheng Jingzhao Zhang Suvrit Sra | 2022/12/6 |
Online policy optimization for robust MDP | arXiv preprint arXiv:2209.13841 | Jing Dong Jingwei Li Baoxiang Wang Jingzhao Zhang | 2022/9/28 |
Understanding the unstable convergence of gradient descent | ICML 2022 (arXiv:2204.01050) | Kwangjun Ahn Jingzhao Zhang Suvrit Sra | 2022/4/3 |
Optimization Theory and Machine Learning Practice: Mind the Gap | Jingzhao Zhang | 2022 | |
Complexity lower bounds for nonconvex-strongly-concave min-max optimization | Advances in Neural Information Processing Systems | Haochuan Li Yi Tian Jingzhao Zhang Ali Jadbabaie | 2021/12/6 |
Neural Network Weights Do Not Converge to Stationary Points: An Invariant Measure Perspective | Jingzhao Zhang Haochuan Li Suvrit Sra Ali Jadbabaie | 2022/6/28 | |
Provably efficient algorithms for multi-objective competitive rl | Tiancheng Yu Yi Tian Jingzhao Zhang Suvrit Sra | 2021/7/1 | |
Fast federated learning in the presence of arbitrary device unavailability | Advances in Neural Information Processing Systems | Xinran Gu Kaixuan Huang Jingzhao Zhang Longbo Huang | 2021/12/6 |
Beyond Worst-Case Analysis in Stochastic Approximation: Moment Estimation Improves Instance Complexity | Jingzhao Zhang Hongzhou Lin Subhro Das Suvrit Sra Ali Jadbabaie | 2022/6/28 | |
Why are adaptive methods good for attention models? | NeurIPS 2020 - Conference on Neural Information Processing Systems | Jingzhao Zhang Sai Praneeth Karimireddy Andreas Veit Seungyeon Kim Sashank J Reddi | 2019/12/6 |