Zeke Wang
Zhejiang University
H-index: 17
Asia-China
Top articles of Zeke Wang
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
DeFT: Flash Tree-attention with IO-Awareness for Efficient Tree-search-based LLM Inference | arXiv preprint arXiv:2404.00242 | Jinwei Yao Kaiqi Chen Kexun Zhang Jiaxuan You Binhang Yuan | 2024/3/30 |
Adding NVMe SSDs to Enable and Accelerate 100B Model Fine-tuning on a Single GPU | arXiv preprint arXiv:2403.06504 | Changyue Liao Mo Sun Zihan Yang Kaiqi Chen Binhang Yuan | 2024/3/11 |
Demystifying Datapath Accelerator Enhanced Off-path SmartNIC | arXiv preprint arXiv:2402.03041 | Xuzheng Chen Jie Zhang Ting Fu Yifan Shen Shu Ma | 2024/2/5 |
Understanding Routable {PCIe} Performance for Composable Infrastructures | Wentao Hou Jie Zhang Zeke Wang Ming Liu | 2024 | |
Legion: Automatically Pushing the Envelope of {Multi-GPU} System for {Billion-Scale}{GNN} Training | Jie Sun Li Su Zuocheng Shi Wenting Shen Zeke Wang | 2023 | |
PyHGL: A Python-based Hardware Generation Language Framework | arXiv preprint arXiv:2309.04859 | Jintao Sun Zeke Wang Tao Lu Wenzhi Chen | 2023/9/9 |
SSiMD: Supporting Six Signed Multiplications in a DSP Block for Low-Precision CNN on FPGAs | Qi Liu Mo Sun Jie Sun Liqiang Lu Jieru Zhao | 2023/12/12 | |
Mars: Exploiting multi-level parallelism for dnn workloads on adaptive multi-accelerator systems | Guan Shen Jieru Zhao Zeke Wang Zhe Lin Wenchao Ding | 2023/7/9 | |
BM-Store: A Transparent and High-performance Local Storage Architecture for Bare-metal Clouds Enabling Large-scale Deployment | Yiquan Chen Jiexiong Xu Chengkun Wei Yijing Wang Xin Yuan | 2023 | |
Critique of “productivity, Portability, Performance: Data-Centric Python” by SCC Team From Zhejiang University | IEEE Transactions on Parallel and Distributed Systems | Zihan Yang Yi Chen Kaiqi Chen Xingjian Qian Shaojun Xu | 2023/11/17 |
Staleness-Reduction Mini-Batch -Means | IEEE Transactions on Neural Networks and Learning Systems | Xueying Zhu Jie Sun Zhenhao He Jiantong Jiang Zeke Wang | 2023/6/16 |
SparseACC: A Generalized Linear Model Accelerator for Sparse Datasets | IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems | Jie Zhang Hongjing Huang Jie Sun Juan Gómez Luna Onur Mutlu | 2023/10/12 |
P4SGD: Programmable Switch Enhanced Model-Parallel Training on Generalized Linear Models on Distributed FPGAs | IEEE Transactions on Parallel and Distributed Systems | Hongjing Huang Yingtao Li Jie Sun Xueying Zhu Jie Zhang | 2023/6/8 |
Helios: An Efficient Out-of-core GNN Training System on Terabyte-scale Graphs with In-memory Performance | arXiv preprint arXiv:2310.00837 | Jie Sun Mo Sun Zheng Zhang Jun Xie Zuocheng Shi | 2023/10/2 |
SmartDS: Middle-Tier-centric SmartNIC Enabling Application-aware Message Split for Disaggregated Block Storage | Jie Zhang Hongjing Huang Lingjun Zhu Shu Ma Dazhong Rong | 2023/6/17 | |
Achelous: Enabling Programmability, Elasticity, and Reliability in Hyperscale Cloud Networks | Chengkun Wei Xing Li Ye Yang Xiaochong Jiang Tianyu Xu | 2023/9/10 | |
Multi-objective Meta-return Reinforcement Learning for Sequential Recommendation | Yemin Yu Kun Kuang Jiangchao Yang Zeke Wang Kunyang Jia | 2022/8/27 | |
Terminator on SkyNet: a practical DVFS attack on DNN hardware IP for UAV object detection | Junge Xu Bohan Xuan Anlin Liu Mo Sun Fan Zhang | 2022/7/10 | |
Cuzk: Accelerating zero-knowledge proof with a faster parallel multi-scalar multiplication algorithm on gpus | Cryptology ePrint Archive | Tao Lu Chengkun Wei Ruijing Yu Chaochao Chen Wenjing Fang | 2022 |
FpgaNIC: An FPGA-based Versatile 100Gb SmartNIC for GPUs | Zeke Wang Hongjing Huang Jie Zhang Fei Wu Gustavo Alonso | 2022 |