ProfessorsProfessors of Peking UniversityXin Jin

Xin Jin

Peking University

H-index: 32

Asia-China

About Xin Jin

Xin Jin, With an exceptional h-index of 32 and a recent h-index of 28 (since 2020), a distinguished researcher at Peking University, specializes in the field of Computer Systems, Computer Networks, Cloud Computing.

His recent articles reflect a diverse array of research interests and contributions to the field:

DistMind: Efficient Resource Disaggregation for Deep Learning Workloads

SoCFlow: Efficient and Scalable DNN Training on SoC-Clustered Edge Servers

Unison: A Parallel-Efficient and User-Transparent Network Simulation Kernel

DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving

RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation

InternEvo: Efficient Long-sequence Large Language Model Training via Hybrid Parallelism and Redundant Sharding

A survey of resource-efficient llm and multimodal foundation models

LoongServe: Efficiently Serving Long-context Large Language Models with Elastic Sequence Parallelism

Xin Jin Information

University	Peking University
Position	Associate Professor
Citations(all)	4907
Citations(since 2020)	3695
Cited By	2367
hIndex(all)	32
hIndex(since 2020)	28
i10Index(all)	55
i10Index(since 2020)	50
Email	Access Email
University Profile Page	Peking University
Google Scholar	View Google Scholar Profile

Xin Jin Skills & Research Interests

Computer Systems

Computer Networks

Cloud Computing

Top articles of Xin Jin

Title	Journal	Author(s)	Publication Date
DistMind: Efficient Resource Disaggregation for Deep Learning Workloads	IEEE/ACM Transactions on Networking	Xin Jin Zhihao Bai Zhen Zhang Yibo Zhu Yinmin Zhong ...	2024/1/24
SoCFlow: Efficient and Scalable DNN Training on SoC-Clustered Edge Servers		Daliang Xu Mengwei Xu Chiheng Lou Li Zhang Gang Huang ...	2024/4/27
Unison: A Parallel-Efficient and User-Transparent Network Simulation Kernel		Songyuan Bai Hao Zheng Chen Tian Xiaoliang Wang Chang Liu ...	2024/4/22
DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving	arXiv preprint arXiv:2401.09670	Yinmin Zhong Shengyu Liu Junda Chen Jianbo Hu Yibo Zhu ...	2024/1/18
RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation	arXiv preprint arXiv:2404.12457	Chao Jin Zili Zhang Xuanlin Jiang Fangyue Liu Xin Liu ...	2024/4/18
InternEvo: Efficient Long-sequence Large Language Model Training via Hybrid Parallelism and Redundant Sharding	arXiv preprint arXiv:2401.09149	Qiaoling Chen Diandian Gu Guoteng Wang Xun Chen YingTong Xiong ...	2024/1/17
A survey of resource-efficient llm and multimodal foundation models	arXiv preprint arXiv:2401.08092	Mengwei Xu Wangsong Yin Dongqi Cai Rongjie Yi Daliang Xu ...	2024/1/16
LoongServe: Efficiently Serving Long-context Large Language Models with Elastic Sequence Parallelism	arXiv preprint arXiv:2404.09526	Bingyang Wu Shengyu Liu Yinmin Zhong Peng Sun Xuanzhe Liu ...	2024/4/15
Jolteon: Unleashing the Promise of Serverless for Serverless Workflows		Zili Zhang Chao Jin Xin Jin	2024
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs	arXiv preprint arXiv:2402.15627	Ziheng Jiang Haibin Lin Yinmin Zhong Qi Huang Yangrui Chen ...	2024/2/23
ElasticFlow: An Elastic Serverless Training Platform for Distributed Deep Learning		Diandian Gu Yihao Zhao Yinmin Zhong Yifan Xiong Zhenhua Han ...	2023
Fast, Approximate Vector Queries on Very Large Unstructured Datasets		Zili Zhang Chao Jin Linpeng Tang Xuanzhe Liu Xin Jin	2023
FaaSLight: General Application-level Cold-start Latency Optimization for Function-as-a-Service in Serverless Computing	ACM Transactions on Software Engineering and Methodology	Xuanzhe Liu Jinfeng Wen Zhenpeng Chen Ding Li Junkai Chen ...	2023/7/22
Klotski: Efficient and Safe Network Migration of Large Production Datacenters		Yihao Zhao Xiaoxiang Zhang Hang Zhu Ying Zhang Zhaodong Wang ...	2023/9/10
Niagara: Scheduling DNN Inference Services on Heterogeneous Edge Processors		Daliang Xu Qing Li Mengwei Xu Kang Huang Gang Huang ...	2023/11/20
Transparent GPU Sharing in Container Clouds for Deep Learning Workloads		Bingyang Wu Zili Zhang Zhihao Bai Xuanzhe Liu Xin Jin	2023/4
Rise of the planet of serverless computing: A systematic review		Jinfeng Wen Zhenpeng Chen Xin Jin Xuanzhe Liu	2023/7/21
Understanding the Micro-Behaviors of Hardware Offloaded Network Stacks with Lumina		Zhuolong Yu Bowen Su Wei Bai Shachar Raindel Vladimir Braverman ...	2023/9/10
Halfmoon: Log-Optimal Fault-Tolerant Stateful Serverless Computing		Sheng Qi Xuanzhe Liu Xin Jin	2023/10/23
Muxflow: Efficient and safe gpu sharing in large-scale production deep learning clusters	arXiv preprint arXiv:2303.13803	Yihao Zhao Xin Liu Shufan Liu Xiang Li Yibo Zhu ...	2023/3/24