Shivaram Venkataraman
University of Wisconsin-Madison
H-index: 39
North America-United States
Top articles of Shivaram Venkataraman
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Mitigating communication bottlenecks during parameter exchange in data-parallel DNN training | 2024/1/9 | ||
Blox: A Modular Toolkit for Deep Learning Schedulers | Saurabh Agarwal Amar Phanishayee Shivaram Venkataraman | 2024/4/22 | |
CHAI: Clustered Head Attention for Efficient LLM Inference | arXiv preprint arXiv:2403.08058 | Saurabh Agarwal Bilge Acun Basil Homer Mostafa Elhoushi Yejin Lee | 2024/3/12 |
Decoding Speculative Decoding | arXiv preprint arXiv:2402.01528 | Minghao Yan Saurabh Agarwal Shivaram Venkataraman | 2024/2/2 |
Estimating Battery State-of-Charge using Machine Learning and Physics-Based Models | Harsh Darshan Sapra Olesia Elfimova Sahana Upadhya Lukas Desorcy Michael Wagner | 2023/4/11 | |
Mirage: Towards Low-interruption Services on Batch GPU Clusters with Reinforcement Learning | Qiyang Ding Pengfei Zheng Shreyas Kudari Shivaram Venkataraman Zhao Zhang | 2023/11/12 | |
Does compressing activations help model parallel training? | arXiv preprint arXiv:2301.02654 | Song Bian Dacheng Li Hongyi Wang Eric P Xing Shivaram Venkataraman | 2023/1/6 |
PolyThrottle: Energy-efficient Neural Network Inference on Edge Devices | arXiv preprint arXiv:2310.19991 | Minghao Yan Hongyi Wang Shivaram Venkataraman | 2023/10/30 |
Shockwave: Fair and efficient cluster scheduling for dynamic adaptation in machine learning | Pengfei Zheng Rui Pan Tarannum Khan Shivaram Venkataraman Aditya Akella | 2023 | |
Bagpipe: Accelerating deep recommendation model training | Saurabh Agarwal Chengpo Yan Ziyi Zhang Shivaram Venkataraman | 2023/10/23 | |
Mariusgnn: Resource-efficient out-of-core training of graph neural networks | Roger Waleffe Jason Mohoney Theodoros Rekatsinas Shivaram Venkataraman | 2023/5/8 | |
F2: Designing a Key-Value Store for Large Skewed Workloads | arXiv preprint arXiv:2305.01516 | Konstantinos Kanellis Badrish Chandramouli Shivaram Venkataraman | 2023/5/2 |
The Roaming Edge and its Applications | GetMobile: Mobile Computing and Communications | Suman Banerjee Remzi Arpaci-Dusseau Shenghong Dai Kassem Fawaz Mohit Gupta | 2022/3/30 |
LlamaTune: Sample-efficient DBMS configuration tuning | arXiv preprint arXiv:2203.05128 | Konstantinos Kanellis Cong Ding Brian Kroth Andreas Müller Carlo Curino | 2022/3/10 |
Marius++: Large-scale training of graph neural networks on a single machine | arXiv preprint arXiv:2202.02365 | Roger Waleffe Jason Mohoney Theodoros Rekatsinas Shivaram Venkataraman | 2022/2/4 |
Not all gpus are created equal: characterizing variability in large-scale, accelerator-rich systems | Prasoon Sinha Akhil Guliani Rutwik Jain Brandon Tran Matthew D Sinclair | 2022/11/13 | |
On the utility of gradient compression in distributed training systems | Proceedings of Machine Learning and Systems | Saurabh Agarwal Hongyi Wang Shivaram Venkataraman Dimitris Papailiopoulos | 2022/4/22 |
P3: Distributed deep graph learning at scale | Swapnil Gandhi Anand Padmanabha Iyer | 2021 | |
Atoll: A scalable low-latency serverless platform | Arjun Singhvi Arjun Balasubramanian Kevin Houck Mohammed Danish Shaikh Shivaram Venkataraman | 2021/11/1 | |
Apache spark | Alexandre da Silva Veith Marcos Dias de Assuncao | 2019 |