Zhiru Zhang
Cornell University
H-index: 40
North America-United States
Top articles of Zhiru Zhang
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
UniSparse: An Intermediate Language for General Sparse Format Customization | arXiv preprint arXiv:2403.05802 | Jie Liu Zhongyuan Zhao Zijian Ding Benjamin Brock Hongbo Rong | 2024/3/9 |
Binarized Neural Machine Translation | Advances in Neural Information Processing Systems | Yichi Zhang Ankush Garg Yuan Cao Lukasz Lew Behrooz Ghorbani | 2024/2/13 |
A Comprehensive Evaluation of FPGA-Based Spatial Acceleration of LLMs | Hongzheng Chen Jiahao Zhang Yixiao Du Shaojie Xiang Zichao Yue | 2024/4/1 | |
LibPreemptible: Enabling Fast, Adaptive, and Hardware-Assisted User-Space Scheduling | Yueying Li Nikita Lazarev David Koufaty Tenny Yin Andy Anderson | 2024 | |
Allo: A Programming Model for Composable Accelerator Design | arXiv preprint arXiv:2404.04815 | Hongzheng Chen Niansong Zhang Shaojie Xiang Zhichen Zeng Mengjia Dai | 2024/4/7 |
Trainable Fixed-Point Quantization for Deep Learning Acceleration on FPGAs | arXiv preprint arXiv:2401.17544 | Dingyi Dai Yichi Zhang Jiahao Zhang Zhanqiu Hu Yaohui Cai | 2024/1/31 |
Exploring the Limits of Semantic Image Compression at Micro-bits per Pixel | arXiv preprint arXiv:2402.13536 | Jordan Dotzel Bahaa Kotb James Dotzel Mohamed Abdelfattah Zhiru Zhang | 2024/2/21 |
Formal Verification of Source-to-Source Transformations for HLS | Louis-Noël Pouchet Emily Tucker Niansong Zhang Hongzheng Chen Debjit Pal | 2024/4/1 | |
Radial Networks: Dynamic Layer Routing for High-Performance Large Language Models | arXiv preprint arXiv:2404.04900 | Jordan Dotzel Yash Akhauri Ahmed S AbouElhamayed Carly Jiang Mohamed Abdelfattah | 2024/4/7 |
SAGMAN: Stability Analysis of Graph Neural Networks on the Manifolds | arXiv preprint arXiv:2402.08653 | Wuxinlin Cheng Chenhui Deng Ali Aghdaei Zhiru Zhang Zhuo Feng | 2024/2/13 |
Decoupled Model Schedule for Deep Learning Training | arXiv e-prints | Hongzheng Chen Cody Hao Yu Shuai Zheng Zhen Zhang Zhiru Zhang | 2023/2 |
A 28-nm 8-bit Floating-Point Tensor Core-Based Programmable CNN Training Processor With Dynamic Structured Sparsity | IEEE Journal of Solid-State Circuits | Shreyas Kolala Venkataramanaiah Jian Meng Han-Sok Suh Injune Yeo Jyotishman Saikia | 2023/5/15 |
RapidStream 2.0: Automated Parallel Implementation of Latency–Insensitive FPGA Designs Through Partial Reconfiguration | ACM Transactions on Reconfigurable Technology and Systems | Licheng Guo Pongstorn Maidee Yun Zhou Chris Lavin Eddie Hung | 2023/9/1 |
An Intermediate Language for General Sparse Format Customization | IEEE Computer Architecture Letters | Jie Liu Zhongyuan Zhao Zijian Ding Benjamin Brock Hongbo Rong | 2023/3/28 |
Comprehensive Benchmarking of Binary Neural Networks on NVM Crossbar Architectures | arXiv preprint arXiv:2308.06227 | Ruirong Huang Zichao Yue Caroline Huang Janarbek Matai Zhiru Zhang | 2023/8/11 |
Supporting a Virtual Vector Instruction Set on a Commercial Compute-in-SRAM Accelerator | IEEE Computer Architecture Letters | Courtney Golden Dan Ilan Caroline Huang Niansong Zhang Zhiru Zhang | 2023/12/11 |
A Case for Open EDA Verticals | Zhiru Zhang Matthew Hofmann Andrew Butt | 2023/3/26 | |
TAPA: a scalable task-parallel dataflow programming framework for modern FPGAs with co-optimization of HLS and physical design | ACM Transactions on Reconfigurable Technology and Systems | Licheng Guo Yuze Chi Jason Lau Linghao Song Xingyu Tian | 2023/12/5 |
FLIQS: One-Shot Mixed-Precision Floating-Point and Integer Quantization Search | arXiv preprint arXiv:2308.03290 | Jordan Dotzel Gang Wu Andrew Li Muhammad Umar Yun Ni | 2023/8/7 |
Slapo: A Schedule Language for Progressive Optimization of Large Deep Learning Model Training | arXiv preprint arXiv:2302.08005 | Hongzheng Chen Cody Hao Yu Shuai Zheng Zhen Zhang Zhiru Zhang | 2023/2/16 |