Shivaram Venkataraman

About Shivaram Venkataraman

Shivaram Venkataraman, With an exceptional h-index of 39 and a recent h-index of 34 (since 2020), a distinguished researcher at University of Wisconsin-Madison, specializes in the field of Systems, Databases.

His recent articles reflect a diverse array of research interests and contributions to the field:

Mitigating communication bottlenecks during parameter exchange in data-parallel DNN training

Blox: A Modular Toolkit for Deep Learning Schedulers

CHAI: Clustered Head Attention for Efficient LLM Inference

Decoding Speculative Decoding

Mirage: Towards Low-interruption Services on Batch GPU Clusters with Reinforcement Learning

PolyThrottle: Energy-efficient Neural Network Inference on Edge Devices

Bagpipe: Accelerating deep recommendation model training

Mariusgnn: Resource-efficient out-of-core training of graph neural networks

Shivaram Venkataraman Information

University

Position

___

Citations(all)

11983

Citations(since 2020)

8484

Cited By

6781

hIndex(all)

39

hIndex(since 2020)

34

i10Index(all)

68

i10Index(since 2020)

56

Email

University Profile Page

Google Scholar

Shivaram Venkataraman Skills & Research Interests

Systems

Databases

Top articles of Shivaram Venkataraman

Mitigating communication bottlenecks during parameter exchange in data-parallel DNN training

2024/1/9

Blox: A Modular Toolkit for Deep Learning Schedulers

2024/4/22

Saurabh Agarwal
Saurabh Agarwal

H-Index: 15

Shivaram Venkataraman
Shivaram Venkataraman

H-Index: 25

CHAI: Clustered Head Attention for Efficient LLM Inference

arXiv preprint arXiv:2403.08058

2024/3/12

Decoding Speculative Decoding

arXiv preprint arXiv:2402.01528

2024/2/2

Saurabh Agarwal
Saurabh Agarwal

H-Index: 15

Shivaram Venkataraman
Shivaram Venkataraman

H-Index: 25

Mirage: Towards Low-interruption Services on Batch GPU Clusters with Reinforcement Learning

2023/11/12

Shivaram Venkataraman
Shivaram Venkataraman

H-Index: 25

Zhao Zhang
Zhao Zhang

H-Index: 15

PolyThrottle: Energy-efficient Neural Network Inference on Edge Devices

arXiv preprint arXiv:2310.19991

2023/10/30

Hongyi Wang
Hongyi Wang

H-Index: 8

Shivaram Venkataraman
Shivaram Venkataraman

H-Index: 25

Bagpipe: Accelerating deep recommendation model training

2023/10/23

Saurabh Agarwal
Saurabh Agarwal

H-Index: 15

Ziyi Zhang
Ziyi Zhang

H-Index: 0

Shivaram Venkataraman
Shivaram Venkataraman

H-Index: 25

Mariusgnn: Resource-efficient out-of-core training of graph neural networks

2023/5/8

F2: Designing a Key-Value Store for Large Skewed Workloads

arXiv preprint arXiv:2305.01516

2023/5/2

Konstantinos Kanellis
Konstantinos Kanellis

H-Index: 1

Shivaram Venkataraman
Shivaram Venkataraman

H-Index: 25

Estimating Battery State-of-Charge using Machine Learning and Physics-Based Models

2023/4/11

Does compressing activations help model parallel training?

arXiv preprint arXiv:2301.02654

2023/1/6

Hongyi Wang
Hongyi Wang

H-Index: 8

Shivaram Venkataraman
Shivaram Venkataraman

H-Index: 25

Shockwave: Fair and efficient cluster scheduling for dynamic adaptation in machine learning

2023

Rui Pan
Rui Pan

H-Index: 4

Shivaram Venkataraman
Shivaram Venkataraman

H-Index: 25

Aditya Akella
Aditya Akella

H-Index: 43

Not all gpus are created equal: characterizing variability in large-scale, accelerator-rich systems

2022/11/13

Akhil Guliani
Akhil Guliani

H-Index: 3

Brandon Tran
Brandon Tran

H-Index: 11

Shivaram Venkataraman
Shivaram Venkataraman

H-Index: 25

On the utility of gradient compression in distributed training systems

Proceedings of Machine Learning and Systems

2022/4/22

Saurabh Agarwal
Saurabh Agarwal

H-Index: 15

Hongyi Wang
Hongyi Wang

H-Index: 8

Shivaram Venkataraman
Shivaram Venkataraman

H-Index: 25

The Roaming Edge and its Applications

GetMobile: Mobile Computing and Communications

2022/3/30

LlamaTune: Sample-efficient DBMS configuration tuning

arXiv preprint arXiv:2203.05128

2022/3/10

Marius++: Large-scale training of graph neural networks on a single machine

arXiv preprint arXiv:2202.02365

2022/2/4

Doing more by doing less: how structured partial backpropagation improves deep learning clusters

2021/12/7

Adarsh Kumar
Adarsh Kumar

H-Index: 2

Shivaram Venkataraman
Shivaram Venkataraman

H-Index: 25

Aditya Akella
Aditya Akella

H-Index: 43

Kaisa: an adaptive second-order optimizer framework for deep neural networks

2021/11/14

Atoll: A scalable low-latency serverless platform

2021/11/1

See List of Professors in Shivaram Venkataraman University(University of Wisconsin-Madison)