Xinjian Li
Carnegie Mellon University
H-index: 12
North America-United States
Top articles of Xinjian Li
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Text-Inductive Graphone-Based Language Adaptation for Low-Resource Speech Synthesis | IEEE/ACM Transactions on Audio, Speech, and Language Processing | Takaaki Saeki Soumi Maiti Xinjian Li Shinji Watanabe Shinnosuke Takamichi | 2024/2/23 |
Learning to speak from text: Zero-shot multilingual text-to-speech with unsupervised text pretraining | arXiv preprint arXiv:2301.12596 | Takaaki Saeki Soumi Maiti Xinjian Li Shinji Watanabe Shinnosuke Takamichi | 2023/1/30 |
Yodas: Youtube-Oriented Dataset for Audio and Speech | Xinjian Li Shinnosuke Takamichi Takaaki Saeki William Chen Sayaka Shiota | 2023/12/16 | |
Reproducing whisper-style training using an open-source toolkit and publicly available data | Yifan Peng Jinchuan Tian Brian Yan Dan Berrebbi Xuankai Chang | 2023/12/16 | |
Cmu’s iwslt 2023 simultaneous speech translation system | Brian Yan Jiatong Shi Soumi Maiti William Chen Xinjian Li | 2023/7 | |
Phone Inventories and Recognition for Every Language | Xinjian Li Florian Metze David R Mortensen Alan W Black Shinji Watanabe | 2022 | |
Textless Direct Speech-to-Speech Translation with Discrete Speech Representation | ICASSP 2023 | Xinjian Li Ye Jia Chung-Cheng Chiu | 2022/10/31 |
ASR2K: Speech Recognition for Around 2000 Languages without Audio | Interspeech 2022 | Xinjian Li Florian Metze David R Mortensen Alan W Black Shinji Watanabe | 2022/9/6 |
On adversarial robustness of large-scale audio visual learning | Juncheng B Li Shuhui Qu Xinjian Li Po-Yao Bernie Huang Florian Metze | 2022/5/23 | |
Zero-shot learning for grapheme to phoneme conversion with language ensemble | Xinjian Li Florian Metze David R Mortensen Shinji Watanabe Alan W Black | 2022/5 | |
Phoneme Recognition through Fine Tuning of Phonetic Representations: a Case Study on Luhya Language Varieties | Interspeech 2021 | Kathleen Siminyu Xinjian Li Antonios Anastasopoulos David Mortensen Michael R Marlo | 2021/4/4 |
On Prosody Modeling for ASR+ TTS based Voice Conversion | ASRU 2021 | Wen-Chin Huang Tomoki Hayashi Xinjian Li Shinji Watanabe Tomoki Toda | 2021/7/20 |
Tusom2021: A Phonetically Transcribed Speech Dataset from an Endangered Language for Universal Phone Recognition Experiments | Interspeech 2021 | David R Mortensen Jordan Picone Xinjian Li Kathleen Siminyu | 2021/4/2 |
Multilingual phonetic dataset for low resource speech recognition | Xinjian Li David R Mortensen Florian Metze Alan W Black | 2021/6/6 | |
Hierarchical Phone Recognition with Compositional Phonetics | Proc. Interspeech 2021 | Xinjian Li Juncheng Li Florian Metze Alan W Black | 2021 |
Phone distribution estimation for low resource languages | Xinjian Li Juncheng Li Jiali Yao Alan W Black Florian Metze | 2021/6/6 | |
Acoustics based intent recognition using discovered phonetic units for low resource languages | Akshat Gupta Xinjian Li Sai Krishna Rallabandi Alan W Black | 2021/6/6 | |
A summary of the first workshop on language technology for language documentation and revitalization | arXiv preprint arXiv:2004.13203 | Graham Neubig Shruti Rijhwani Alexis Palmer Jordan MacKenzie Hilaria Cruz | 2020/4/27 |
AlloVera: a multilingual allophone database | David R Mortensen Xinjian Li Patrick Littell Alexis Michaud Shruti Rijhwani | 2020/4/17 | |
Universal phone recognition with a multilingual allophone system | Xinjian Li Siddharth Dalmia Juncheng Li Matthew Lee Patrick Littell | 2020/2/26 |