Shivaram Venkataraman
University of Wisconsin-Madison
H-index: 39
North America-United States
Top articles of Shivaram Venkataraman
Mitigating communication bottlenecks during parameter exchange in data-parallel DNN training
2024/1/9
Blox: A Modular Toolkit for Deep Learning Schedulers
2024/4/22
Saurabh Agarwal
H-Index: 15
Shivaram Venkataraman
H-Index: 25
CHAI: Clustered Head Attention for Efficient LLM Inference
arXiv preprint arXiv:2403.08058
2024/3/12
Decoding Speculative Decoding
arXiv preprint arXiv:2402.01528
2024/2/2
Saurabh Agarwal
H-Index: 15
Shivaram Venkataraman
H-Index: 25
Mirage: Towards Low-interruption Services on Batch GPU Clusters with Reinforcement Learning
2023/11/12
Shivaram Venkataraman
H-Index: 25
Zhao Zhang
H-Index: 15
PolyThrottle: Energy-efficient Neural Network Inference on Edge Devices
arXiv preprint arXiv:2310.19991
2023/10/30
Hongyi Wang
H-Index: 8
Shivaram Venkataraman
H-Index: 25
Bagpipe: Accelerating deep recommendation model training
2023/10/23
Mariusgnn: Resource-efficient out-of-core training of graph neural networks
2023/5/8
F2: Designing a Key-Value Store for Large Skewed Workloads
arXiv preprint arXiv:2305.01516
2023/5/2
Konstantinos Kanellis
H-Index: 1
Shivaram Venkataraman
H-Index: 25
Estimating Battery State-of-Charge using Machine Learning and Physics-Based Models
2023/4/11
Sahana Upadhya
H-Index: 1
Michael Wagner
H-Index: 18
Shivaram Venkataraman
H-Index: 25
Sage Kokjohn
H-Index: 28
Does compressing activations help model parallel training?
arXiv preprint arXiv:2301.02654
2023/1/6
Hongyi Wang
H-Index: 8
Shivaram Venkataraman
H-Index: 25
Shockwave: Fair and efficient cluster scheduling for dynamic adaptation in machine learning
2023
Not all gpus are created equal: characterizing variability in large-scale, accelerator-rich systems
2022/11/13
On the utility of gradient compression in distributed training systems
Proceedings of Machine Learning and Systems
2022/4/22
The Roaming Edge and its Applications
GetMobile: Mobile Computing and Communications
2022/3/30
Suman Banerjee
H-Index: 15
Remzi Arpaci-Dusseau
H-Index: 38
Kassem Fawaz
H-Index: 16
Mohit Gupta
H-Index: 4
Kangwook Lee
H-Index: 14
Shivaram Venkataraman
H-Index: 25
LlamaTune: Sample-efficient DBMS configuration tuning
arXiv preprint arXiv:2203.05128
2022/3/10
Konstantinos Kanellis
H-Index: 1
Cong Ding
H-Index: 1
Brian Kroth
H-Index: 3
Andreas Müller
H-Index: 4
Shivaram Venkataraman
H-Index: 25
Marius++: Large-scale training of graph neural networks on a single machine
arXiv preprint arXiv:2202.02365
2022/2/4
Doing more by doing less: how structured partial backpropagation improves deep learning clusters
2021/12/7
Kaisa: an adaptive second-order optimizer framework for deep neural networks
2021/11/14
Qi Huang
H-Index: 5
Lei Huang
H-Index: 4
Shivaram Venkataraman
H-Index: 25
Kyle Chard
H-Index: 25
Zhao Zhang
H-Index: 15
Atoll: A scalable low-latency serverless platform
2021/11/1
Arjun Singhvi
H-Index: 6
Mohammed Danish Shaikh
H-Index: 1
Shivaram Venkataraman
H-Index: 25
Aditya Akella
H-Index: 43