Xin Jin
Peking University
H-index: 32
Asia-China
Top articles of Xin Jin
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
DistMind: Efficient Resource Disaggregation for Deep Learning Workloads | IEEE/ACM Transactions on Networking | Xin Jin Zhihao Bai Zhen Zhang Yibo Zhu Yinmin Zhong | 2024/1/24 |
SoCFlow: Efficient and Scalable DNN Training on SoC-Clustered Edge Servers | Daliang Xu Mengwei Xu Chiheng Lou Li Zhang Gang Huang | 2024/4/27 | |
Unison: A Parallel-Efficient and User-Transparent Network Simulation Kernel | Songyuan Bai Hao Zheng Chen Tian Xiaoliang Wang Chang Liu | 2024/4/22 | |
DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving | arXiv preprint arXiv:2401.09670 | Yinmin Zhong Shengyu Liu Junda Chen Jianbo Hu Yibo Zhu | 2024/1/18 |
RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation | arXiv preprint arXiv:2404.12457 | Chao Jin Zili Zhang Xuanlin Jiang Fangyue Liu Xin Liu | 2024/4/18 |
InternEvo: Efficient Long-sequence Large Language Model Training via Hybrid Parallelism and Redundant Sharding | arXiv preprint arXiv:2401.09149 | Qiaoling Chen Diandian Gu Guoteng Wang Xun Chen YingTong Xiong | 2024/1/17 |
A survey of resource-efficient llm and multimodal foundation models | arXiv preprint arXiv:2401.08092 | Mengwei Xu Wangsong Yin Dongqi Cai Rongjie Yi Daliang Xu | 2024/1/16 |
LoongServe: Efficiently Serving Long-context Large Language Models with Elastic Sequence Parallelism | arXiv preprint arXiv:2404.09526 | Bingyang Wu Shengyu Liu Yinmin Zhong Peng Sun Xuanzhe Liu | 2024/4/15 |
Jolteon: Unleashing the Promise of Serverless for Serverless Workflows | Zili Zhang Chao Jin Xin Jin | 2024 | |
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs | arXiv preprint arXiv:2402.15627 | Ziheng Jiang Haibin Lin Yinmin Zhong Qi Huang Yangrui Chen | 2024/2/23 |
ElasticFlow: An Elastic Serverless Training Platform for Distributed Deep Learning | Diandian Gu Yihao Zhao Yinmin Zhong Yifan Xiong Zhenhua Han | 2023 | |
Fast, Approximate Vector Queries on Very Large Unstructured Datasets | Zili Zhang Chao Jin Linpeng Tang Xuanzhe Liu Xin Jin | 2023 | |
FaaSLight: General Application-level Cold-start Latency Optimization for Function-as-a-Service in Serverless Computing | ACM Transactions on Software Engineering and Methodology | Xuanzhe Liu Jinfeng Wen Zhenpeng Chen Ding Li Junkai Chen | 2023/7/22 |
Klotski: Efficient and Safe Network Migration of Large Production Datacenters | Yihao Zhao Xiaoxiang Zhang Hang Zhu Ying Zhang Zhaodong Wang | 2023/9/10 | |
Niagara: Scheduling DNN Inference Services on Heterogeneous Edge Processors | Daliang Xu Qing Li Mengwei Xu Kang Huang Gang Huang | 2023/11/20 | |
Transparent GPU Sharing in Container Clouds for Deep Learning Workloads | Bingyang Wu Zili Zhang Zhihao Bai Xuanzhe Liu Xin Jin | 2023/4 | |
Rise of the planet of serverless computing: A systematic review | Jinfeng Wen Zhenpeng Chen Xin Jin Xuanzhe Liu | 2023/7/21 | |
Understanding the Micro-Behaviors of Hardware Offloaded Network Stacks with Lumina | Zhuolong Yu Bowen Su Wei Bai Shachar Raindel Vladimir Braverman | 2023/9/10 | |
Halfmoon: Log-Optimal Fault-Tolerant Stateful Serverless Computing | Sheng Qi Xuanzhe Liu Xin Jin | 2023/10/23 | |
Muxflow: Efficient and safe gpu sharing in large-scale production deep learning clusters | arXiv preprint arXiv:2303.13803 | Yihao Zhao Xin Liu Shufan Liu Xiang Li Yibo Zhu | 2023/3/24 |