Jiatong Shi (史嘉彤)
Johns Hopkins University
H-index: 15
North America-United States
Top articles of Jiatong Shi (史嘉彤)
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Singing Voice Data Scaling-up: An Introduction to ACE-Opencpop and KiSing-v2 | arXiv preprint arXiv:2401.17619 | Jiatong Shi Yueqian Lin Xinyi Bai Keyi Zhang Yuning Wu | 2024/1/31 |
A Large-Scale Evaluation of Speech Foundation Models | IEEE/ACM Transactions on Audio, Speech, and Language Processing | Shu-wen Yang Heng-Jui Chang Zili Huang Andy T Liu Cheng-I Lai | 2024/4/16 |
OWSM v3. 1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer | arXiv preprint arXiv:2401.16658 | Yifan Peng Jinchuan Tian William Chen Siddhant Arora Brian Yan | 2024/1/30 |
Hubertopic: Enhancing Semantic Representation of Hubert Through Self-Supervision Utilizing Topic Model | Takashi Maekaku Jiatong Shi Xuankai Chang Yuya Fujita Shinji Watanabe | 2024/4/14 | |
ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models | arXiv preprint arXiv:2401.17230 | Jee-weon Jung Wangyou Zhang Jiatong Shi Zakaria Aldeneh Takuya Higuchi | 2024/1/30 |
Exploring speech recognition, translation, and understanding with discrete speech units: A comparative study | Xuankai Chang Brian Yan Kwanghee Choi Jee-Weon Jung Yichen Lu | 2024/4/14 | |
Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech | Chien-yu Huang Ke-Han Lu Shih-Heng Wang Chi-Yuan Hsiao Chun-Yi Kuan | 2024/4/14 | |
Audiogpt: Understanding and generating speech, music, sound, and talking head | Proceedings of the AAAI Conference on Artificial Intelligence | Rongjie Huang Mingze Li Dongchao Yang Jiatong Shi Xuankai Chang | 2024/3/24 |
ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit | arXiv preprint arXiv:2304.04596 | Brian Yan Jiatong Shi Yun Tang Hirofumi Inaguma Yifan Peng | 2023/4/10 |
Improving cascaded unsupervised speech translation with denoising back-translation | arXiv preprint arXiv:2305.07455 | Yu-Kuan Fu Liang-Hsuan Tseng Jiatong Shi Chen-An Li Tsu-Yuan Hsu | 2023/5/12 |
Phoneix: Acoustic Feature Processing Strategy for Enhanced Singing Pronunciation With Phoneme Distribution Predictor | Yuning Wu Jiatong Shi Tao Qian Dongji Gao Qin Jin | 2023/6/4 | |
Reproducing whisper-style training using an open-source toolkit and publicly available data | Yifan Peng Jinchuan Tian Brian Yan Dan Berrebbi Xuankai Chang | 2023/12/16 | |
Joint prediction and denoising for large-scale multilingual self-supervised learning | arXiv preprint arXiv:2309.15317 | William Chen Jiatong Shi Brian Yan Dan Berrebbi Wangyou Zhang | 2023/9/26 |
SUPERB@ SLT 2022: Challenge on generalization and efficiency of self-supervised speech representation learning | Tzu-hsun Feng Annie Dong Ching-Feng Yeh Shu-wen Yang Tzu-Quan Lin | 2023/1/9 | |
Improving massively multilingual ASR with auxiliary CTC objectives | William Chen Brian Yan Jiatong Shi Yifan Peng Soumi Maiti | 2023/6/4 | |
The singing voice conversion challenge 2023 | Wen-Chin Huang Lester Phillip Violeta Songxiang Liu Jiatong Shi Tomoki Toda | 2023/12/16 | |
A Systematic Exploration of Joint-training for Singing Voice Synthesis | arXiv preprint arXiv:2308.02867 | Yuning Wu Yifeng Yu Jiatong Shi Tao Qian Qin Jin | 2023/8/5 |
On compressing sequences for self-supervised speech models | Yen Meng Hsuan-Jui Chen Jiatong Shi Shinji Watanabe Paola Garcia | 2023/1/9 | |
EFFUSE: Efficient self-supervised feature fusion for E2E ASR in multilingual and low resource scenarios | arXiv preprint arXiv:2310.03938 | Tejes Srivastava Jiatong Shi William Chen Shinji Watanabe | 2023/10/5 |
EURO: ESPnet unsupervised asr open-source toolkit | Dongji Gao Jiatong Shi Shun-Po Chuang Leibny Paola Garcia Hung-yi Lee | 2023/6/4 |