Wen-Chin Huang
Nagoya University
H-index: 20
Asia-Japan
Top articles of Wen-Chin Huang
A Large-Scale Evaluation of Speech Foundation Models
IEEE/ACM Transactions on Audio, Speech, and Language Processing
2024/4/16
Electrolaryngeal Speech Intelligibility Enhancement through Robust Linguistic Encoders
2024/4/14
A review on subjective and objective evaluation of synthetic speech
2024
A Comparative Study of Voice Conversion Models With Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023
2023/12/16
Wen-Chin Huang
H-Index: 9
Tomoki Toda
H-Index: 34
The VoiceMOS Challenge 2023: zero-shot subjective speech quality prediction for multiple domains
2023/12/16
Wen-Chin Huang
H-Index: 9
Tomoki Toda
H-Index: 34
The singing voice conversion challenge 2023
2023/12/16
Evaluating Methods for Ground-Truth-Free Foreign Accent Conversion
2023/10/31
Wen-Chin Huang
H-Index: 9
Tomoki Toda
H-Index: 34
AAS-VC: On the Generalization Ability of Automatic Alignment Search based Non-autoregressive Sequence-to-sequence Voice Conversion
arXiv preprint arXiv:2309.07598
2023/9/14
A holistic cascade system, benchmark, and human evaluation protocol for expressive speech-to-speech translation
2023/6/4
Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition
2023/6/4
Generalization ability of mos prediction networks
2022/5/23
Wen-Chin Huang
H-Index: 9
Tomoki Toda
H-Index: 34
Investigating self-supervised pretraining frameworks for pathological speech recognition
arXiv preprint arXiv:2203.15431
2022/3/29
Wen-Chin Huang
H-Index: 9
Tomoki Toda
H-Index: 34
The voicemos challenge 2022
arXiv preprint arXiv:2203.11389
2022/3/21
Wen-Chin Huang
H-Index: 9
Tomoki Toda
H-Index: 34
SUPERB-SG: Enhanced speech processing universal PERformance benchmark for semantic and generative capabilities
arXiv preprint arXiv:2203.06849
2022/3/14
A comparative study of self-supervised speech representation based voice conversion
CASSP 2023 Satellite Workshop: SASB 2023: Self-Supervision in Audio, Speech and Beyond
2023/8/20
End-to-end binaural speech synthesis
arXiv preprint arXiv:2207.03697
2022/7/8
Ldnet: Unified listener dependent modeling in mos prediction for synthetic speech
2022/5/23
Wen-Chin Huang
H-Index: 9
Tomoki Toda
H-Index: 34
Towards identity preserving normal to dysarthric voice conversion
2022/5/23
S3prl-vc: Open-source voice conversion framework with self-supervised speech representations
2022/5/23
Wen-Chin Huang
H-Index: 9
Tomoki Hayashi
H-Index: 19
Hung-Yi Lee
H-Index: 21
Shinji Watanabe
H-Index: 45
Tomoki Toda
H-Index: 34
Investigation of Text-to-Speech-based Synthetic Parallel Data for Sequence-to-Sequence Non-Parallel Voice Conversion
2021/12/14