Gennady Pekhimenko
University of Toronto
H-index: 35
North America-Canada
Top articles of Gennady Pekhimenko
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Minuet: Accelerating 3D sparse convolutions on GPUs | Jiacheng Yang Christina Giannoula Jun Wu Mostafa Elhoushi James Gleeson | 2024/4/22 | |
Proteus: Preserving Model Confidentiality during Graph Optimizations | arXiv preprint arXiv:2404.12512 | Yubo Gao Maryam Haghifam Christina Giannoula Renbo Tu Gennady Pekhimenko | 2024/4/18 |
Accelerating Graph Neural Networks on Real Processing-In-Memory Systems | arXiv preprint arXiv:2402.16731 | Christina Giannoula Peiming Yang Ivan Fernandez Vega Jiacheng Yang Yu Xin Li | 2024/2/26 |
Arbitor: A Numerically Accurate Hardware Emulation Tool for {DNN} Accelerators | Chenhao Jiang Anand Jayarajan Hao Lu Gennady Pekhimenko | 2023 | |
The synergy of speculative decoding and batching in serving large language models | arXiv preprint arXiv:2310.18813 | Qidong Su Christina Giannoula Gennady Pekhimenko | 2023/10/28 |
Federated benchmarking of medical artificial intelligence with MedPerf | Nature Machine Intelligence | Alexandros Karargyris Renato Umeton Micah J Sheller Alejandro Aristizabal Johnu George | 2023/7 |
Grape: Practical and Efficient Graphed Execution for Dynamic Deep Neural Networks on GPUs | Bojian Zheng Cody Hao Yu Jie Wang Yaoyao Ding Yizhi Liu | 2023/10/28 | |
Mixing sparsity compression | 2023/3/30 | ||
Guaranteed Approximation Bounds for Mixed-Precision Neural Operators | Renbo Tu Colin White Jean Kossaifi Boris Bonev Gennady Pekhimenko | 2023/10/13 | |
Hotline Profiler: Automatic Annotation and A Multi-Scale Timeline for Visualizing Time-Use in DNN Training | Proceedings of Machine Learning and Systems | Daniel Snider Fanny Chevalier Gennady Pekhimenko | 2023/3/18 |
Efficient data encoding for deep neural network training | 2023/8/1 | ||
Tilt: A time-centric approach for stream query optimization and parallelization | Anand Jayarajan Wei Zhao Yudi Sun Gennady Pekhimenko | 2023/1/27 | |
Lightweight Frequency-Based Tiering for CXL Memory Systems | arXiv preprint arXiv:2312.04789 | Kevin Song Jiacheng Yang Sihang Liu Gennady Pekhimenko | 2023/12/8 |
Hidet: Task-mapping programming paradigm for deep learning tensor programs | Yaoyao Ding Cody Hao Yu Bojian Zheng Yizhi Liu Yida Wang | 2023/1/27 | |
Speeding up fourier neural operators via mixed precision | arXiv preprint arXiv:2307.15034 | Colin White Renbo Tu Jean Kossaifi Gennady Pekhimenko Kamyar Azizzadenesheli | 2023/7/27 |
TorchProbe: Fuzzing Dynamic Deep Learning Compilers | Qidong Su Chuqin Geng Gennady Pekhimenko Xujie Si | 2023/11/21 | |
Keynote Talk 1: Efficient DNN Training at Scale: from Algorithms to Hardware | Gennady Pekhimenko | 2022/5/30 | |
Tempo: Accelerating Transformer-Based Model Training through Memory Footprint Reduction | Advances in Neural Information Processing Systems | Muralidhar Andoorveedu Zhanda Zhu Bojian Zheng Gennady Pekhimenko | 2022/12/6 |
DietCode: Automatic optimization for dynamic tensor programs | Proceedings of Machine Learning and Systems | Bojian Zheng Ziheng Jiang Cody Hao Yu Haichen Shen Joshua Fromm | 2022/4/22 |
GPUPool: A Holistic Approach to Fine-Grained GPU Sharing in the Cloud | Xiaodan Serina Tan Pavel Golikov Nandita Vijaykumar Gennady Pekhimenko | 2022/10/8 |