ProfessorsProfessors of Johns Hopkins UniversityJiatong Shi (史嘉彤)

Jiatong Shi (史嘉彤)

Johns Hopkins University

H-index: 15

North America-United States

About Jiatong Shi (史嘉彤)

Jiatong Shi (史嘉彤), With an exceptional h-index of 15 and a recent h-index of 15 (since 2020), a distinguished researcher at Johns Hopkins University, specializes in the field of Speech Processing, Speech Recognition, Music Processing.

His recent articles reflect a diverse array of research interests and contributions to the field:

Singing Voice Data Scaling-up: An Introduction to ACE-Opencpop and KiSing-v2

A Large-Scale Evaluation of Speech Foundation Models

OWSM v3. 1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

Hubertopic: Enhancing Semantic Representation of Hubert Through Self-Supervision Utilizing Topic Model

ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models

Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study

Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech

Audiogpt: Understanding and generating speech, music, sound, and talking head

Jiatong Shi (史嘉彤) Information

University	Johns Hopkins University
Position	The
Citations(all)	1490
Citations(since 2020)	1490
Cited By	15
hIndex(all)	15
hIndex(since 2020)	15
i10Index(all)	22
i10Index(since 2020)	22
Email	Access Email
University Profile Page	Johns Hopkins University
Google Scholar	View Google Scholar Profile

Jiatong Shi (史嘉彤) Skills & Research Interests

Speech Processing

Speech Recognition

Music Processing

Top articles of Jiatong Shi (史嘉彤)

Title	Journal	Author(s)	Publication Date
Singing Voice Data Scaling-up: An Introduction to ACE-Opencpop and KiSing-v2	arXiv preprint arXiv:2401.17619	Jiatong Shi Yueqian Lin Xinyi Bai Keyi Zhang Yuning Wu ...	2024/1/31
A Large-Scale Evaluation of Speech Foundation Models	IEEE/ACM Transactions on Audio, Speech, and Language Processing	Shu-wen Yang Heng-Jui Chang Zili Huang Andy T Liu Cheng-I Lai ...	2024/4/16
OWSM v3. 1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer	arXiv preprint arXiv:2401.16658	Yifan Peng Jinchuan Tian William Chen Siddhant Arora Brian Yan ...	2024/1/30
Hubertopic: Enhancing Semantic Representation of Hubert Through Self-Supervision Utilizing Topic Model		Takashi Maekaku Jiatong Shi Xuankai Chang Yuya Fujita Shinji Watanabe	2024/4/14
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models	arXiv preprint arXiv:2401.17230	Jee-weon Jung Wangyou Zhang Jiatong Shi Zakaria Aldeneh Takuya Higuchi ...	2024/1/30
Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study		Xuankai Chang Brian Yan Kwanghee Choi Jee-Weon Jung Yichen Lu ...	2024/4/14
Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech		Chien-yu Huang Ke-Han Lu Shih-Heng Wang Chi-Yuan Hsiao Chun-Yi Kuan ...	2024/4/14
Audiogpt: Understanding and generating speech, music, sound, and talking head	Proceedings of the AAAI Conference on Artificial Intelligence	Rongjie Huang Mingze Li Dongchao Yang Jiatong Shi Xuankai Chang ...	2024/3/24
ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit	arXiv preprint arXiv:2304.04596	Brian Yan Jiatong Shi Yun Tang Hirofumi Inaguma Yifan Peng ...	2023/4/10
Improving cascaded unsupervised speech translation with denoising back-translation	arXiv preprint arXiv:2305.07455	Yu-Kuan Fu Liang-Hsuan Tseng Jiatong Shi Chen-An Li Tsu-Yuan Hsu ...	2023/5/12
Phoneix: Acoustic Feature Processing Strategy for Enhanced Singing Pronunciation With Phoneme Distribution Predictor		Yuning Wu Jiatong Shi Tao Qian Dongji Gao Qin Jin	2023/6/4
Reproducing whisper-style training using an open-source toolkit and publicly available data		Yifan Peng Jinchuan Tian Brian Yan Dan Berrebbi Xuankai Chang ...	2023/12/16
Joint prediction and denoising for large-scale multilingual self-supervised learning	arXiv preprint arXiv:2309.15317	William Chen Jiatong Shi Brian Yan Dan Berrebbi Wangyou Zhang ...	2023/9/26
SUPERB@ SLT 2022: Challenge on generalization and efficiency of self-supervised speech representation learning		Tzu-hsun Feng Annie Dong Ching-Feng Yeh Shu-wen Yang Tzu-Quan Lin ...	2023/1/9
Improving massively multilingual ASR with auxiliary CTC objectives		William Chen Brian Yan Jiatong Shi Yifan Peng Soumi Maiti ...	2023/6/4
The singing voice conversion challenge 2023		Wen-Chin Huang Lester Phillip Violeta Songxiang Liu Jiatong Shi Tomoki Toda	2023/12/16
A Systematic Exploration of Joint-training for Singing Voice Synthesis	arXiv preprint arXiv:2308.02867	Yuning Wu Yifeng Yu Jiatong Shi Tao Qian Qin Jin	2023/8/5
On compressing sequences for self-supervised speech models		Yen Meng Hsuan-Jui Chen Jiatong Shi Shinji Watanabe Paola Garcia ...	2023/1/9
EFFUSE: Efficient self-supervised feature fusion for E2E ASR in multilingual and low resource scenarios	arXiv preprint arXiv:2310.03938	Tejes Srivastava Jiatong Shi William Chen Shinji Watanabe	2023/10/5
EURO: ESPnet unsupervised asr open-source toolkit		Dongji Gao Jiatong Shi Shun-Po Chuang Leibny Paola Garcia Hung-yi Lee ...	2023/6/4