ProfessorsProfessors of University of California, BerkeleyYakun Sophia Shao

Yakun Sophia Shao

University of California, Berkeley

H-index: 25

North America-United States

About Yakun Sophia Shao

Yakun Sophia Shao, With an exceptional h-index of 25 and a recent h-index of 23 (since 2020), a distinguished researcher at University of California, Berkeley, specializes in the field of Computer Architecture, VLSI.

His recent articles reflect a diverse array of research interests and contributions to the field:

Deep neural network accelerator with fine-grained parallelism discovery

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

DOSA: Differentiable Model-Based One-Loop Search for DNN Accelerators

Cdpu: Co-designing compression and decompression processing units for hyperscale systems

RETROSPECTIVE: Aladdin: a Pre-RTL, Power-Performance Accelerator Simulator Enabling Large Design Space Exploration of Customized Architectures

Full stack optimization of transformer inference: a survey

Cost of Divergence in Ray Tracing: Performance Characterization on CPU and GPU

MoCA: Memory-centric, adaptive execution for multi-tenant deep neural networks

Yakun Sophia Shao Information

University	University of California, Berkeley
Position	Assistant Professor
Citations(all)	3256
Citations(since 2020)	2746
Cited By	1136
hIndex(all)	25
hIndex(since 2020)	23
i10Index(all)	32
i10Index(since 2020)	31
Email	Access Email
University Profile Page	University of California, Berkeley
Google Scholar	View Google Scholar Profile

Yakun Sophia Shao Skills & Research Interests

Computer Architecture

VLSI

Top articles of Yakun Sophia Shao

Title	Journal	Author(s)	Publication Date
Deep neural network accelerator with fine-grained parallelism discovery			2019/12/5
KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization	arXiv preprint arXiv:2401.18079	Coleman Hooper Sehoon Kim Hiva Mohammadzadeh Michael W Mahoney Yakun Sophia Shao ...	2024/1/31
DOSA: Differentiable Model-Based One-Loop Search for DNN Accelerators		Charles Hong Qijing Huang Grace Dinh Mahesh Subedar Yakun Sophia Shao	2023
Cdpu: Co-designing compression and decompression processing units for hyperscale systems		Sagar Karandikar Aniruddha N Udipi Junsun Choi Joonho Whangbo Jerry Zhao ...	2023/6/17
RETROSPECTIVE: Aladdin: a Pre-RTL, Power-Performance Accelerator Simulator Enabling Large Design Space Exploration of Customized Architectures		Yakun Sophia Shao Brandon Reagen Gu-Yeon Wei David Brooks	2023
Full stack optimization of transformer inference: a survey	arXiv preprint arXiv:2302.14017	Sehoon Kim Coleman Hooper Thanakul Wattanawong Minwoo Kang Ruohan Yan ...	2023/2/27
Cost of Divergence in Ray Tracing: Performance Characterization on CPU and GPU		Hansung Kim Angie Wang Sizhuo Zhang Yakun Sophia Shao	2023
MoCA: Memory-centric, adaptive execution for multi-tenant deep neural networks		Seah Kim Hasan Genc Vadim Vadimovich Nikiforov Krste Asanović Borivoje Nikolić ...	2023/2/25
Scalable multi-die deep learning system			2023/9/26
Enabling Scalable Heterogeneous Hardware Integration Co-simulation with Socket IPC		Ruohan Yan Zekai Lin Hansung Kim John Kubiatowicz Yakun Sophia Shao	2023
Code Transpilation for Hardware Accelerators	arXiv preprint arXiv:2308.06410	Yuto Nishida Sahil Bhatia Shadaj Laddad Hasan Genc Yakun Sophia Shao ...	2023/8/11
AuRORA: Virtualized Accelerator Orchestration for Multi-Tenant Workloads		Seah Kim Jerry Zhao Krste Asanović Borivoje Nikolić Yakun Sophia Shao	2023
Rosé: A hardware-software co-simulation infrastructure enabling pre-silicon full-stack robotics soc evaluation		Dima Nikiforov Shengjun Chris Dong Chengyi Lux Zhang Seah Kim Borivoje Nikolic ...	2023/6/17
Efficient Neural Network Accelerator Dataflows			2022/3/10
Accelerating General-Purpose Linear Algebra on DNN Accelerators		Alon Amid Hasan Genc Jerry Zhao Krste Asanovic Borivoje Nikolic ...	2022
Learning A Continuous and Reconstructible Latent Space for Hardware Accelerator Design		Qijing Huang Charles Hong John Wawrzynek Mahesh Subedar Yakun Sophia Shao	2022
Efficient emotion recognition using hyperdimensional computing with combinatorial channel encoding and cellular automata	Brain informatics	Alisha Menon Anirudh Natarajan Reva Agashe Daniel Sun Melvin Aristio ...	2022/12
Research infrastructures for hardware accelerators		Yakun Sophia Shao David Brooks	2022/5/31
CoSA: Scheduling by Constrained Optimization for Spatial Accelerators		Qijing Huang Minwoo Kang Grace Dinh Thomas Norell Aravind Kalaiah ...	2021/6
Simba: scaling deep-learning inference with chiplet-based architecture	Communications of the ACM	Yakun Sophia Shao Jason Cemons Rangharajan Venkatesan Brian Zimmer Matthew Fojtik ...	2021/5/24