Shivaram Venkataraman

Shivaram Venkataraman

University of Wisconsin-Madison

H-index: 39

North America-United States

About Shivaram Venkataraman

Shivaram Venkataraman, With an exceptional h-index of 39 and a recent h-index of 34 (since 2020), a distinguished researcher at University of Wisconsin-Madison, specializes in the field of Systems, Databases.

His recent articles reflect a diverse array of research interests and contributions to the field:

Mitigating communication bottlenecks during parameter exchange in data-parallel DNN training

Blox: A Modular Toolkit for Deep Learning Schedulers

CHAI: Clustered Head Attention for Efficient LLM Inference

Decoding Speculative Decoding

Estimating Battery State-of-Charge using Machine Learning and Physics-Based Models

Mirage: Towards Low-interruption Services on Batch GPU Clusters with Reinforcement Learning

Does compressing activations help model parallel training?

PolyThrottle: Energy-efficient Neural Network Inference on Edge Devices

Shivaram Venkataraman Information

University

Position

___

Citations(all)

11983

Citations(since 2020)

8484

Cited By

6781

hIndex(all)

39

hIndex(since 2020)

34

i10Index(all)

68

i10Index(since 2020)

56

Email

University Profile Page

University of Wisconsin-Madison

Google Scholar

View Google Scholar Profile

Shivaram Venkataraman Skills & Research Interests

Systems

Databases

Top articles of Shivaram Venkataraman

Title

Journal

Author(s)

Publication Date

Mitigating communication bottlenecks during parameter exchange in data-parallel DNN training

2024/1/9

Blox: A Modular Toolkit for Deep Learning Schedulers

Saurabh Agarwal

Amar Phanishayee

Shivaram Venkataraman

2024/4/22

CHAI: Clustered Head Attention for Efficient LLM Inference

arXiv preprint arXiv:2403.08058

Saurabh Agarwal

Bilge Acun

Basil Homer

Mostafa Elhoushi

Yejin Lee

...

2024/3/12

Decoding Speculative Decoding

arXiv preprint arXiv:2402.01528

Minghao Yan

Saurabh Agarwal

Shivaram Venkataraman

2024/2/2

Estimating Battery State-of-Charge using Machine Learning and Physics-Based Models

Harsh Darshan Sapra

Olesia Elfimova

Sahana Upadhya

Lukas Desorcy

Michael Wagner

...

2023/4/11

Mirage: Towards Low-interruption Services on Batch GPU Clusters with Reinforcement Learning

Qiyang Ding

Pengfei Zheng

Shreyas Kudari

Shivaram Venkataraman

Zhao Zhang

2023/11/12

Does compressing activations help model parallel training?

arXiv preprint arXiv:2301.02654

Song Bian

Dacheng Li

Hongyi Wang

Eric P Xing

Shivaram Venkataraman

2023/1/6

PolyThrottle: Energy-efficient Neural Network Inference on Edge Devices

arXiv preprint arXiv:2310.19991

Minghao Yan

Hongyi Wang

Shivaram Venkataraman

2023/10/30

Shockwave: Fair and efficient cluster scheduling for dynamic adaptation in machine learning

Pengfei Zheng

Rui Pan

Tarannum Khan

Shivaram Venkataraman

Aditya Akella

2023

Bagpipe: Accelerating deep recommendation model training

Saurabh Agarwal

Chengpo Yan

Ziyi Zhang

Shivaram Venkataraman

2023/10/23

Mariusgnn: Resource-efficient out-of-core training of graph neural networks

Roger Waleffe

Jason Mohoney

Theodoros Rekatsinas

Shivaram Venkataraman

2023/5/8

F2: Designing a Key-Value Store for Large Skewed Workloads

arXiv preprint arXiv:2305.01516

Konstantinos Kanellis

Badrish Chandramouli

Shivaram Venkataraman

2023/5/2

The Roaming Edge and its Applications

GetMobile: Mobile Computing and Communications

Suman Banerjee

Remzi Arpaci-Dusseau

Shenghong Dai

Kassem Fawaz

Mohit Gupta

...

2022/3/30

LlamaTune: Sample-efficient DBMS configuration tuning

arXiv preprint arXiv:2203.05128

Konstantinos Kanellis

Cong Ding

Brian Kroth

Andreas Müller

Carlo Curino

...

2022/3/10

Marius++: Large-scale training of graph neural networks on a single machine

arXiv preprint arXiv:2202.02365

Roger Waleffe

Jason Mohoney

Theodoros Rekatsinas

Shivaram Venkataraman

2022/2/4

Not all gpus are created equal: characterizing variability in large-scale, accelerator-rich systems

Prasoon Sinha

Akhil Guliani

Rutwik Jain

Brandon Tran

Matthew D Sinclair

...

2022/11/13

On the utility of gradient compression in distributed training systems

Proceedings of Machine Learning and Systems

Saurabh Agarwal

Hongyi Wang

Shivaram Venkataraman

Dimitris Papailiopoulos

2022/4/22

P3: Distributed deep graph learning at scale

Swapnil Gandhi

Anand Padmanabha Iyer

2021

Atoll: A scalable low-latency serverless platform

Arjun Singhvi

Arjun Balasubramanian

Kevin Houck

Mohammed Danish Shaikh

Shivaram Venkataraman

...

2021/11/1

Apache spark

Alexandre da Silva Veith

Marcos Dias de Assuncao

2019

See List of Professors in Shivaram Venkataraman University(University of Wisconsin-Madison)