Yakun Sophia Shao

Yakun Sophia Shao

University of California, Berkeley

H-index: 25

North America-United States

About Yakun Sophia Shao

Yakun Sophia Shao, With an exceptional h-index of 25 and a recent h-index of 23 (since 2020), a distinguished researcher at University of California, Berkeley, specializes in the field of Computer Architecture, VLSI.

His recent articles reflect a diverse array of research interests and contributions to the field:

Deep neural network accelerator with fine-grained parallelism discovery

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

DOSA: Differentiable Model-Based One-Loop Search for DNN Accelerators

Cdpu: Co-designing compression and decompression processing units for hyperscale systems

RETROSPECTIVE: Aladdin: a Pre-RTL, Power-Performance Accelerator Simulator Enabling Large Design Space Exploration of Customized Architectures

Full stack optimization of transformer inference: a survey

Cost of Divergence in Ray Tracing: Performance Characterization on CPU and GPU

MoCA: Memory-centric, adaptive execution for multi-tenant deep neural networks

Yakun Sophia Shao Information

University

Position

Assistant Professor

Citations(all)

3256

Citations(since 2020)

2746

Cited By

1136

hIndex(all)

25

hIndex(since 2020)

23

i10Index(all)

32

i10Index(since 2020)

31

Email

University Profile Page

University of California, Berkeley

Google Scholar

View Google Scholar Profile

Yakun Sophia Shao Skills & Research Interests

Computer Architecture

VLSI

Top articles of Yakun Sophia Shao

Title

Journal

Author(s)

Publication Date

Deep neural network accelerator with fine-grained parallelism discovery

2019/12/5

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

arXiv preprint arXiv:2401.18079

Coleman Hooper

Sehoon Kim

Hiva Mohammadzadeh

Michael W Mahoney

Yakun Sophia Shao

...

2024/1/31

DOSA: Differentiable Model-Based One-Loop Search for DNN Accelerators

Charles Hong

Qijing Huang

Grace Dinh

Mahesh Subedar

Yakun Sophia Shao

2023

Cdpu: Co-designing compression and decompression processing units for hyperscale systems

Sagar Karandikar

Aniruddha N Udipi

Junsun Choi

Joonho Whangbo

Jerry Zhao

...

2023/6/17

RETROSPECTIVE: Aladdin: a Pre-RTL, Power-Performance Accelerator Simulator Enabling Large Design Space Exploration of Customized Architectures

Yakun Sophia Shao

Brandon Reagen

Gu-Yeon Wei

David Brooks

2023

Full stack optimization of transformer inference: a survey

arXiv preprint arXiv:2302.14017

Sehoon Kim

Coleman Hooper

Thanakul Wattanawong

Minwoo Kang

Ruohan Yan

...

2023/2/27

Cost of Divergence in Ray Tracing: Performance Characterization on CPU and GPU

Hansung Kim

Angie Wang

Sizhuo Zhang

Yakun Sophia Shao

2023

MoCA: Memory-centric, adaptive execution for multi-tenant deep neural networks

Seah Kim

Hasan Genc

Vadim Vadimovich Nikiforov

Krste Asanović

Borivoje Nikolić

...

2023/2/25

Scalable multi-die deep learning system

2023/9/26

Enabling Scalable Heterogeneous Hardware Integration Co-simulation with Socket IPC

Ruohan Yan

Zekai Lin

Hansung Kim

John Kubiatowicz

Yakun Sophia Shao

2023

Code Transpilation for Hardware Accelerators

arXiv preprint arXiv:2308.06410

Yuto Nishida

Sahil Bhatia

Shadaj Laddad

Hasan Genc

Yakun Sophia Shao

...

2023/8/11

AuRORA: Virtualized Accelerator Orchestration for Multi-Tenant Workloads

Seah Kim

Jerry Zhao

Krste Asanović

Borivoje Nikolić

Yakun Sophia Shao

2023

Rosé: A hardware-software co-simulation infrastructure enabling pre-silicon full-stack robotics soc evaluation

Dima Nikiforov

Shengjun Chris Dong

Chengyi Lux Zhang

Seah Kim

Borivoje Nikolic

...

2023/6/17

Efficient Neural Network Accelerator Dataflows

2022/3/10

Accelerating General-Purpose Linear Algebra on DNN Accelerators

Alon Amid

Hasan Genc

Jerry Zhao

Krste Asanovic

Borivoje Nikolic

...

2022

Learning A Continuous and Reconstructible Latent Space for Hardware Accelerator Design

Qijing Huang

Charles Hong

John Wawrzynek

Mahesh Subedar

Yakun Sophia Shao

2022

Efficient emotion recognition using hyperdimensional computing with combinatorial channel encoding and cellular automata

Brain informatics

Alisha Menon

Anirudh Natarajan

Reva Agashe

Daniel Sun

Melvin Aristio

...

2022/12

Research infrastructures for hardware accelerators

Yakun Sophia Shao

David Brooks

2022/5/31

CoSA: Scheduling by Constrained Optimization for Spatial Accelerators

Qijing Huang

Minwoo Kang

Grace Dinh

Thomas Norell

Aravind Kalaiah

...

2021/6

Simba: scaling deep-learning inference with chiplet-based architecture

Communications of the ACM

Yakun Sophia Shao

Jason Cemons

Rangharajan Venkatesan

Brian Zimmer

Matthew Fojtik

...

2021/5/24

See List of Professors in Yakun Sophia Shao University(University of California, Berkeley)

Co-Authors

H-index: 107
William Dally

William Dally

Stanford University

H-index: 71
Joel Emer

Joel Emer

Massachusetts Institute of Technology

H-index: 66
David Brooks

David Brooks

Harvard University

H-index: 59
Gu-Yeon Wei

Gu-Yeon Wei

Harvard University

H-index: 23
Brandon Reagen

Brandon Reagen

New York University

H-index: 11
Bob Adolf

Bob Adolf

Harvard University

academic-engine