ProfessorsProfessors of University of California, BerkeleyMichael Mahoney

Michael Mahoney

University of California, Berkeley

H-index: 75

North America-United States

About Michael Mahoney

Michael Mahoney, With an exceptional h-index of 75 and a recent h-index of 66 (since 2020), a distinguished researcher at University of California, Berkeley, specializes in the field of Algorithms, Statistics, Linear Algebra, Data Analysis, Machine Learning.

His recent articles reflect a diverse array of research interests and contributions to the field:

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Ai and memory wall

Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training

Comparing and contrasting deep learning weather prediction backbones on navier-stokes dynamics

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Speculative decoding with big little decoder

A Heavy-Tailed Algebra for Probabilistic Programming

Using Uncertainty Quantification to Characterize and Improve Out-of-Domain Learning for PDEs

Michael Mahoney Information

University	University of California, Berkeley
Position	Professor of Statistics
Citations(all)	31012
Citations(since 2020)	17274
Cited By	19940
hIndex(all)	75
hIndex(since 2020)	66
i10Index(all)	221
i10Index(since 2020)	191
Email	Access Email
University Profile Page	University of California, Berkeley
Google Scholar	View Google Scholar Profile

Michael Mahoney Skills & Research Interests

Algorithms

Statistics

Linear Algebra

Data Analysis

Machine Learning

Top articles of Michael Mahoney

Title	Journal	Author(s)	Publication Date
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization	arXiv preprint arXiv:2401.18079	Coleman Hooper Sehoon Kim Hiva Mohammadzadeh Michael W Mahoney Yakun Sophia Shao ...	2024/1/31
Ai and memory wall	IEEE Micro	Amir Gholami Zhewei Yao Sehoon Kim Coleman Hooper Michael W Mahoney ...	2024/3/25
Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training	Advances in Neural Information Processing Systems	Yefan Zhou Tianyu Pang Keqin Liu Michael W Mahoney Yaoqing Yang	2024/2/13
Comparing and contrasting deep learning weather prediction backbones on navier-stokes dynamics		Matthias Karlbauer Danielle Maddix Robinson Abdul Fatir Ansari Boran Han Gaurav Gupta ...	2024
LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement	arXiv preprint arXiv:2403.15042	Nicholas Lee Thanakul Wattanawong Sehoon Kim Karttikeya Mangalam Sheng Shen ...	2024/3/22
Speculative decoding with big little decoder	Advances in Neural Information Processing Systems	Sehoon Kim Karttikeya Mangalam Suhong Moon Jitendra Malik Michael W Mahoney ...	2024/2/13
A Heavy-Tailed Algebra for Probabilistic Programming	Advances in Neural Information Processing Systems	Feynman T Liang Liam Hodgkinson Michael W Mahoney	2024/2/13
Using Uncertainty Quantification to Characterize and Improve Out-of-Domain Learning for PDEs	arXiv preprint arXiv:2403.10642	S Chandra Mouli Danielle C Maddix Shima Alizadeh Gaurav Gupta Andrew Stuart ...	2024/3/15
Towards foundation models for scientific machine learning: Characterizing scaling and transfer behavior	Advances in Neural Information Processing Systems	Shashank Subramanian Peter Harrington Kurt Keutzer Wahid Bhimji Dmitriy Morozov ...	2024/2/13
Chronos: Learning the language of time series	arXiv preprint arXiv:2403.07815	Abdul Fatir Ansari Lorenzo Stella Caner Turkmen Xiyuan Zhang Pedro Mercado ...	2024/3/12
Equation Discovery with Bayesian Spike-and-Slab Priors and Efficient Kernels		Da Long Wei Xing Aditi Krishnapriyan Robert Kirby Shandian Zhe ...	2024/4/18
When are ensembles really effective?	Advances in Neural Information Processing Systems	Ryan Theisen Hyunsuk Kim Yaoqing Yang Liam Hodgkinson Michael W Mahoney	2024/2/13
Data-Efficient Operator Learning via Unsupervised Pretraining and In-Context Learning	arXiv preprint arXiv:2402.15734	Wuyang Chen Jialin Song Pu Ren Shashank Subramanian Dmitriy Morozov ...	2024/2/24
NoisyMix: Boosting model robustness to common corruptions	Proc. of the 27th International Conference on AISTATS	N Benjamin Erichson Soon Hoe Lim Winnie Xu Francisco Utrera Ziang Cao ...	2024/2/2
Full stack optimization of transformer inference: a survey	arXiv preprint arXiv:2302.14017	Sehoon Kim Coleman Hooper Thanakul Wattanawong Minwoo Kang Ruohan Yan ...	2023/2/27
Squeezellm: Dense-and-sparse quantization	arXiv preprint arXiv:2306.07629	Sehoon Kim* Coleman Hooper* Amir Gholami* Zhen Dong Xiuyu Li ...	2023/6/13
Surrogate-based Autotuning for Randomized Sketching Algorithms in Regression Problems	arXiv preprint arXiv:2308.15720	Younghyun Cho James W Demmel Michał Dereziński Haoyun Li Hengrui Luo ...	2023/8/30
Learning continuous models for continuous physics	Communications Physics	Aditi S Krishnapriyan Alejandro F Queiruga N Benjamin Erichson Michael W Mahoney	2023/11/3
Constrained optimization via exact augmented lagrangian and randomized iterative sketching		Ilgee Hong Sen Na Michael W Mahoney Mladen Kolar	2023/7/3
DMLR: Data-centric Machine Learning Research--Past, Present and Future	arXiv preprint arXiv:2311.13028	Luis Oala Manil Maskey Lilith Bat-Leah Alicia Parrish Nezihe Merve Gürel ...	2023/11/21