Jiatong Shi (史嘉彤)

Jiatong Shi (史嘉彤)

Johns Hopkins University

H-index: 15

North America-United States

About Jiatong Shi (史嘉彤)

Jiatong Shi (史嘉彤), With an exceptional h-index of 15 and a recent h-index of 15 (since 2020), a distinguished researcher at Johns Hopkins University, specializes in the field of Speech Processing, Speech Recognition, Music Processing.

His recent articles reflect a diverse array of research interests and contributions to the field:

Singing Voice Data Scaling-up: An Introduction to ACE-Opencpop and KiSing-v2

A Large-Scale Evaluation of Speech Foundation Models

OWSM v3. 1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

Hubertopic: Enhancing Semantic Representation of Hubert Through Self-Supervision Utilizing Topic Model

ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models

Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study

Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech

Audiogpt: Understanding and generating speech, music, sound, and talking head

Jiatong Shi (史嘉彤) Information

University

Position

The

Citations(all)

1490

Citations(since 2020)

1490

Cited By

15

hIndex(all)

15

hIndex(since 2020)

15

i10Index(all)

22

i10Index(since 2020)

22

Email

University Profile Page

Johns Hopkins University

Google Scholar

View Google Scholar Profile

Jiatong Shi (史嘉彤) Skills & Research Interests

Speech Processing

Speech Recognition

Music Processing

Top articles of Jiatong Shi (史嘉彤)

Title

Journal

Author(s)

Publication Date

Singing Voice Data Scaling-up: An Introduction to ACE-Opencpop and KiSing-v2

arXiv preprint arXiv:2401.17619

Jiatong Shi

Yueqian Lin

Xinyi Bai

Keyi Zhang

Yuning Wu

...

2024/1/31

A Large-Scale Evaluation of Speech Foundation Models

IEEE/ACM Transactions on Audio, Speech, and Language Processing

Shu-wen Yang

Heng-Jui Chang

Zili Huang

Andy T Liu

Cheng-I Lai

...

2024/4/16

OWSM v3. 1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

arXiv preprint arXiv:2401.16658

Yifan Peng

Jinchuan Tian

William Chen

Siddhant Arora

Brian Yan

...

2024/1/30

Hubertopic: Enhancing Semantic Representation of Hubert Through Self-Supervision Utilizing Topic Model

Takashi Maekaku

Jiatong Shi

Xuankai Chang

Yuya Fujita

Shinji Watanabe

2024/4/14

ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models

arXiv preprint arXiv:2401.17230

Jee-weon Jung

Wangyou Zhang

Jiatong Shi

Zakaria Aldeneh

Takuya Higuchi

...

2024/1/30

Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study

Xuankai Chang

Brian Yan

Kwanghee Choi

Jee-Weon Jung

Yichen Lu

...

2024/4/14

Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech

Chien-yu Huang

Ke-Han Lu

Shih-Heng Wang

Chi-Yuan Hsiao

Chun-Yi Kuan

...

2024/4/14

Audiogpt: Understanding and generating speech, music, sound, and talking head

Proceedings of the AAAI Conference on Artificial Intelligence

Rongjie Huang

Mingze Li

Dongchao Yang

Jiatong Shi

Xuankai Chang

...

2024/3/24

ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit

arXiv preprint arXiv:2304.04596

Brian Yan

Jiatong Shi

Yun Tang

Hirofumi Inaguma

Yifan Peng

...

2023/4/10

Improving cascaded unsupervised speech translation with denoising back-translation

arXiv preprint arXiv:2305.07455

Yu-Kuan Fu

Liang-Hsuan Tseng

Jiatong Shi

Chen-An Li

Tsu-Yuan Hsu

...

2023/5/12

Phoneix: Acoustic Feature Processing Strategy for Enhanced Singing Pronunciation With Phoneme Distribution Predictor

Yuning Wu

Jiatong Shi

Tao Qian

Dongji Gao

Qin Jin

2023/6/4

Reproducing whisper-style training using an open-source toolkit and publicly available data

Yifan Peng

Jinchuan Tian

Brian Yan

Dan Berrebbi

Xuankai Chang

...

2023/12/16

Joint prediction and denoising for large-scale multilingual self-supervised learning

arXiv preprint arXiv:2309.15317

William Chen

Jiatong Shi

Brian Yan

Dan Berrebbi

Wangyou Zhang

...

2023/9/26

SUPERB@ SLT 2022: Challenge on generalization and efficiency of self-supervised speech representation learning

Tzu-hsun Feng

Annie Dong

Ching-Feng Yeh

Shu-wen Yang

Tzu-Quan Lin

...

2023/1/9

Improving massively multilingual ASR with auxiliary CTC objectives

William Chen

Brian Yan

Jiatong Shi

Yifan Peng

Soumi Maiti

...

2023/6/4

The singing voice conversion challenge 2023

Wen-Chin Huang

Lester Phillip Violeta

Songxiang Liu

Jiatong Shi

Tomoki Toda

2023/12/16

A Systematic Exploration of Joint-training for Singing Voice Synthesis

arXiv preprint arXiv:2308.02867

Yuning Wu

Yifeng Yu

Jiatong Shi

Tao Qian

Qin Jin

2023/8/5

On compressing sequences for self-supervised speech models

Yen Meng

Hsuan-Jui Chen

Jiatong Shi

Shinji Watanabe

Paola Garcia

...

2023/1/9

EFFUSE: Efficient self-supervised feature fusion for E2E ASR in multilingual and low resource scenarios

arXiv preprint arXiv:2310.03938

Tejes Srivastava

Jiatong Shi

William Chen

Shinji Watanabe

2023/10/5

EURO: ESPnet unsupervised asr open-source toolkit

Dongji Gao

Jiatong Shi

Shun-Po Chuang

Leibny Paola Garcia

Hung-yi Lee

...

2023/6/4

See List of Professors in Jiatong Shi (史嘉彤) University(Johns Hopkins University)

Co-Authors

H-index: 74
Shinji Watanabe

Shinji Watanabe

Carnegie Mellon University

H-index: 47
Hung-yi Lee

Hung-yi Lee

National Taiwan University

H-index: 31
Tomoki Hayashi

Tomoki Hayashi

Nagoya University

H-index: 23
Xuankai Chang

Xuankai Chang

Carnegie Mellon University

H-index: 18
Hirofumi Inaguma

Hirofumi Inaguma

Kyoto University

H-index: 13
Peter Wu

Peter Wu

Carnegie Mellon University

academic-engine