Michael Mahoney
University of California, Berkeley
H-index: 75
North America-United States
Top articles of Michael Mahoney
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization | arXiv preprint arXiv:2401.18079 | Coleman Hooper Sehoon Kim Hiva Mohammadzadeh Michael W Mahoney Yakun Sophia Shao | 2024/1/31 |
Ai and memory wall | IEEE Micro | Amir Gholami Zhewei Yao Sehoon Kim Coleman Hooper Michael W Mahoney | 2024/3/25 |
Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training | Advances in Neural Information Processing Systems | Yefan Zhou Tianyu Pang Keqin Liu Michael W Mahoney Yaoqing Yang | 2024/2/13 |
Comparing and contrasting deep learning weather prediction backbones on navier-stokes dynamics | Matthias Karlbauer Danielle Maddix Robinson Abdul Fatir Ansari Boran Han Gaurav Gupta | 2024 | |
LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement | arXiv preprint arXiv:2403.15042 | Nicholas Lee Thanakul Wattanawong Sehoon Kim Karttikeya Mangalam Sheng Shen | 2024/3/22 |
Speculative decoding with big little decoder | Advances in Neural Information Processing Systems | Sehoon Kim Karttikeya Mangalam Suhong Moon Jitendra Malik Michael W Mahoney | 2024/2/13 |
A Heavy-Tailed Algebra for Probabilistic Programming | Advances in Neural Information Processing Systems | Feynman T Liang Liam Hodgkinson Michael W Mahoney | 2024/2/13 |
Using Uncertainty Quantification to Characterize and Improve Out-of-Domain Learning for PDEs | arXiv preprint arXiv:2403.10642 | S Chandra Mouli Danielle C Maddix Shima Alizadeh Gaurav Gupta Andrew Stuart | 2024/3/15 |
Towards foundation models for scientific machine learning: Characterizing scaling and transfer behavior | Advances in Neural Information Processing Systems | Shashank Subramanian Peter Harrington Kurt Keutzer Wahid Bhimji Dmitriy Morozov | 2024/2/13 |
Chronos: Learning the language of time series | arXiv preprint arXiv:2403.07815 | Abdul Fatir Ansari Lorenzo Stella Caner Turkmen Xiyuan Zhang Pedro Mercado | 2024/3/12 |
Equation Discovery with Bayesian Spike-and-Slab Priors and Efficient Kernels | Da Long Wei Xing Aditi Krishnapriyan Robert Kirby Shandian Zhe | 2024/4/18 | |
When are ensembles really effective? | Advances in Neural Information Processing Systems | Ryan Theisen Hyunsuk Kim Yaoqing Yang Liam Hodgkinson Michael W Mahoney | 2024/2/13 |
Data-Efficient Operator Learning via Unsupervised Pretraining and In-Context Learning | arXiv preprint arXiv:2402.15734 | Wuyang Chen Jialin Song Pu Ren Shashank Subramanian Dmitriy Morozov | 2024/2/24 |
NoisyMix: Boosting model robustness to common corruptions | Proc. of the 27th International Conference on AISTATS | N Benjamin Erichson Soon Hoe Lim Winnie Xu Francisco Utrera Ziang Cao | 2024/2/2 |
Full stack optimization of transformer inference: a survey | arXiv preprint arXiv:2302.14017 | Sehoon Kim Coleman Hooper Thanakul Wattanawong Minwoo Kang Ruohan Yan | 2023/2/27 |
Squeezellm: Dense-and-sparse quantization | arXiv preprint arXiv:2306.07629 | Sehoon Kim* Coleman Hooper* Amir Gholami* Zhen Dong Xiuyu Li | 2023/6/13 |
Surrogate-based Autotuning for Randomized Sketching Algorithms in Regression Problems | arXiv preprint arXiv:2308.15720 | Younghyun Cho James W Demmel Michał Dereziński Haoyun Li Hengrui Luo | 2023/8/30 |
Learning continuous models for continuous physics | Communications Physics | Aditi S Krishnapriyan Alejandro F Queiruga N Benjamin Erichson Michael W Mahoney | 2023/11/3 |
Constrained optimization via exact augmented lagrangian and randomized iterative sketching | Ilgee Hong Sen Na Michael W Mahoney Mladen Kolar | 2023/7/3 | |
DMLR: Data-centric Machine Learning Research--Past, Present and Future | arXiv preprint arXiv:2311.13028 | Luis Oala Manil Maskey Lilith Bat-Leah Alicia Parrish Nezihe Merve Gürel | 2023/11/21 |