Jingwen Leng
Shanghai Jiao Tong University
H-index: 22
Asia-China
Top articles of Jingwen Leng
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
JUNO: Optimizing High-Dimensional Approximate Nearest Neighbour Search with Sparsity-Aware Algorithm and Ray-Tracing Core Mapping | arXiv preprint arXiv:2312.01712 | Zihan Liu Wentao Ni Jingwen Leng Yu Feng Cong Guo | 2023/12/4 |
Amanda: Unified Instrumentation Framework for Deep Neural Networks | Yue Guan Yuxian Qiu Jingwen Leng Fan Yang Shuo Yu | 2024/4/27 | |
Towards Fast Setup and High Throughput of GPU Serverless Computing | arXiv preprint arXiv:2404.14691 | Han Zhao Weihao Cui Quan Chen Shulai Zhang Zijun Li | 2024/4/23 |
Fractal: Joint Multi-Level Sparse Pattern Tuning of Accuracy and Performance for DNN Pruning | Yue Guan Changming Yu Yangjie Zhou Jingwen Leng Chao Li | 2024/4/27 | |
Fovea Transformer: Efficient Long-Context Modeling with Structured Fine-To-Coarse Attention | Ziwei He Jian Yuan Le Zhou Jingwen Leng Bo Jiang | 2023/11/13 | |
MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN | Renze Chen Zijian Ding Size Zheng Chengrui Zhang Jingwen Leng | 2024/4/27 | |
Accelerating Sparse DNNs Based on Tiled GEMM | arXiv preprint arXiv:2402.10876 | Cong Guo Fengchen Xue Jingwen Leng Yuxian Qiu Yue Guan | 2024/2/16 |
GMLake: Efficient and Transparent GPU Memory Defragmentation for Large-scale DNN Training with Virtual Memory Stitching | arXiv preprint arXiv:2401.08156 | Cong Guo Rui Zhang Jiale Xu Jingwen Leng Zihan Liu | 2024/1/16 |
ImaGen: A general framework for generating memory-and power-efficient image processing accelerators | Nisarg Ujjainkar Jingwen Leng Yuhao Zhu | 2023/6/17 | |
Not All Resources are Visible: Exploiting Fragmented Shadow Resources in Shared-State Scheduler Architecture | Xinkai Wang Hao He Yuancheng Li Chao Li Xiaofeng Hou | 2023/10/30 | |
Chimera: An analytical optimizing framework for effective compute-intensive operators fusion | Size Zheng Siyuan Chen Peidi Song Renze Chen Xiuhong Li | 2023/2/25 | |
Olive: Accelerating large language models via hardware-friendly outlier-victim pair quantization | Cong Guo Jiaming Tang Weiming Hu Jingwen Leng Chen Zhang | 2023/6/17 | |
Accelerating generic graph neural networks via architecture, compiler, partition method co-design | arXiv preprint arXiv:2308.08174 | Shuwen Lu Zhihui Zhang Cong Guo Jingwen Leng Yangjie Zhou | 2023/8/16 |
ugrapher: High-performance graph operator computation via unified abstraction for graph neural networks | Yangjie Zhou Jingwen Leng Yaoxu Song Shuwen Lu Mian Wang | 2023/1/27 | |
Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT Operator | Ziwei He Meng Yang Minwei Feng Jingcheng Yin Xinbing Wang | 2023/5/24 | |
Improving Cluster Utilization through Adaptive Resource Management for DNN and CPU Jobs Co-location | IEEE Transactions on Computers | Han Zhao Weihao Cui Quan Chen Jingwen Leng Deze Zeng | 2023/8/10 |
FIRST: Exploiting the Multi-Dimensional Attributes of Functions for Power-Aware Serverless Computing | Lu Zhang Chao Li Xinkai Wang Weiqi Feng Zheng Yu | 2023/5/15 | |
Pac: Preference-aware co-location scheduling on heterogeneous numa architectures to improve resource utilization | Pu Pang Yaoxuan Li Bo Liu Quan Chen Zhou Yu | 2023/6/21 | |
DistSim: A performance model of large-scale hybrid distributed DNN training | Guandong Lu Runzhe Chen Yakai Wang Yangjie Zhou Rui Zhang | 2023/5/9 | |
DFlow: Efficient Dataflow-based Invocation Workflow Execution for Function-as-a-Service | arXiv preprint arXiv:2306.11043 | Xiaoxiang Shi Chao Li Zijun Li Zihan Liu Dianmo Sheng | 2023/6/19 |