Gennady Pekhimenko

Gennady Pekhimenko

University of Toronto

H-index: 35

North America-Canada

About Gennady Pekhimenko

Gennady Pekhimenko, With an exceptional h-index of 35 and a recent h-index of 33 (since 2020), a distinguished researcher at University of Toronto, specializes in the field of Computer Architecture, Systems, Systems for ML, Machine Learning.

His recent articles reflect a diverse array of research interests and contributions to the field:

Minuet: Accelerating 3D sparse convolutions on GPUs

Proteus: Preserving Model Confidentiality during Graph Optimizations

Accelerating Graph Neural Networks on Real Processing-In-Memory Systems

Arbitor: A Numerically Accurate Hardware Emulation Tool for {DNN} Accelerators

The synergy of speculative decoding and batching in serving large language models

Federated benchmarking of medical artificial intelligence with MedPerf

Grape: Practical and Efficient Graphed Execution for Dynamic Deep Neural Networks on GPUs

Mixing sparsity compression

Gennady Pekhimenko Information

University

Position

___

Citations(all)

5460

Citations(since 2020)

3868

Cited By

3011

hIndex(all)

35

hIndex(since 2020)

33

i10Index(all)

55

i10Index(since 2020)

51

Email

University Profile Page

University of Toronto

Google Scholar

View Google Scholar Profile

Gennady Pekhimenko Skills & Research Interests

Computer Architecture

Systems

Systems for ML

Machine Learning

Top articles of Gennady Pekhimenko

Title

Journal

Author(s)

Publication Date

Minuet: Accelerating 3D sparse convolutions on GPUs

Jiacheng Yang

Christina Giannoula

Jun Wu

Mostafa Elhoushi

James Gleeson

...

2024/4/22

Proteus: Preserving Model Confidentiality during Graph Optimizations

arXiv preprint arXiv:2404.12512

Yubo Gao

Maryam Haghifam

Christina Giannoula

Renbo Tu

Gennady Pekhimenko

...

2024/4/18

Accelerating Graph Neural Networks on Real Processing-In-Memory Systems

arXiv preprint arXiv:2402.16731

Christina Giannoula

Peiming Yang

Ivan Fernandez Vega

Jiacheng Yang

Yu Xin Li

...

2024/2/26

Arbitor: A Numerically Accurate Hardware Emulation Tool for {DNN} Accelerators

Chenhao Jiang

Anand Jayarajan

Hao Lu

Gennady Pekhimenko

2023

The synergy of speculative decoding and batching in serving large language models

arXiv preprint arXiv:2310.18813

Qidong Su

Christina Giannoula

Gennady Pekhimenko

2023/10/28

Federated benchmarking of medical artificial intelligence with MedPerf

Nature Machine Intelligence

Alexandros Karargyris

Renato Umeton

Micah J Sheller

Alejandro Aristizabal

Johnu George

...

2023/7

Grape: Practical and Efficient Graphed Execution for Dynamic Deep Neural Networks on GPUs

Bojian Zheng

Cody Hao Yu

Jie Wang

Yaoyao Ding

Yizhi Liu

...

2023/10/28

Mixing sparsity compression

2023/3/30

Guaranteed Approximation Bounds for Mixed-Precision Neural Operators

Renbo Tu

Colin White

Jean Kossaifi

Boris Bonev

Gennady Pekhimenko

...

2023/10/13

Hotline Profiler: Automatic Annotation and A Multi-Scale Timeline for Visualizing Time-Use in DNN Training

Proceedings of Machine Learning and Systems

Daniel Snider

Fanny Chevalier

Gennady Pekhimenko

2023/3/18

Efficient data encoding for deep neural network training

2023/8/1

Tilt: A time-centric approach for stream query optimization and parallelization

Anand Jayarajan

Wei Zhao

Yudi Sun

Gennady Pekhimenko

2023/1/27

Lightweight Frequency-Based Tiering for CXL Memory Systems

arXiv preprint arXiv:2312.04789

Kevin Song

Jiacheng Yang

Sihang Liu

Gennady Pekhimenko

2023/12/8

Hidet: Task-mapping programming paradigm for deep learning tensor programs

Yaoyao Ding

Cody Hao Yu

Bojian Zheng

Yizhi Liu

Yida Wang

...

2023/1/27

Speeding up fourier neural operators via mixed precision

arXiv preprint arXiv:2307.15034

Colin White

Renbo Tu

Jean Kossaifi

Gennady Pekhimenko

Kamyar Azizzadenesheli

...

2023/7/27

TorchProbe: Fuzzing Dynamic Deep Learning Compilers

Qidong Su

Chuqin Geng

Gennady Pekhimenko

Xujie Si

2023/11/21

Keynote Talk 1: Efficient DNN Training at Scale: from Algorithms to Hardware

Gennady Pekhimenko

2022/5/30

Tempo: Accelerating Transformer-Based Model Training through Memory Footprint Reduction

Advances in Neural Information Processing Systems

Muralidhar Andoorveedu

Zhanda Zhu

Bojian Zheng

Gennady Pekhimenko

2022/12/6

DietCode: Automatic optimization for dynamic tensor programs

Proceedings of Machine Learning and Systems

Bojian Zheng

Ziheng Jiang

Cody Hao Yu

Haichen Shen

Joshua Fromm

...

2022/4/22

GPUPool: A Holistic Approach to Fine-Grained GPU Sharing in the Cloud

Xiaodan Serina Tan

Pavel Golikov

Nandita Vijaykumar

Gennady Pekhimenko

2022/10/8

See List of Professors in Gennady Pekhimenko University(University of Toronto)

Co-Authors

H-index: 86
Phillip Gibbons

Phillip Gibbons

Carnegie Mellon University

H-index: 58
Todd C. Mowry

Todd C. Mowry

Carnegie Mellon University

H-index: 45
Saugata Ghose

Saugata Ghose

University of Illinois at Urbana-Champaign

H-index: 40
Hadi Esmaeilzadeh

Hadi Esmaeilzadeh

University of California, San Diego

H-index: 31
Samira Khan

Samira Khan

University of Virginia

H-index: 30
Rachata Ausavarungnirun

Rachata Ausavarungnirun

King Mongkut's University of Technology North Bangkok

academic-engine