Wen-Chin Huang at Nagoya University

University	Nagoya University
Position	___
Citations(all)	1636
Citations(since 2020)	1632
Cited By	213
hIndex(all)	20
hIndex(since 2020)	20
i10Index(all)	25
i10Index(since 2020)	25
Email	Access Email
University Profile Page	Nagoya University
Google Scholar	View Google Scholar Profile

A Large-Scale Evaluation of Speech Foundation Models

IEEE/ACM Transactions on Audio, Speech, and Language Processing

2024/4/16

Heng-Jui Chang

H-Index: 0

Haibin Wu

H-Index: 6

Jiatong Shi

H-Index: 2

Xuankai Chang

H-Index: 11

Wen-Chin Huang

H-Index: 9

Po-Han Chi

H-Index: 3

Yung-Sung Chuang

H-Index: 2

Wei-Cheng Tseng

H-Index: 1

Abdelrahman Mohamed

H-Index: 2

Shinji Watanabe

H-Index: 45

Hung-Yi Lee

H-Index: 21

Electrolaryngeal Speech Intelligibility Enhancement through Robust Linguistic Encoders

2024/4/14

Wen-Chin Huang

H-Index: 9

Ding Ma

H-Index: 7

Kazuhiro Kobayashi

H-Index: 16

Tomoki Toda

H-Index: 34

A review on subjective and objective evaluation of synthetic speech

2024

A Comparative Study of Voice Conversion Models With Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023

2023/12/16

Wen-Chin Huang

H-Index: 9

Tomoki Toda

H-Index: 34

The VoiceMOS Challenge 2023: zero-shot subjective speech quality prediction for multiple domains

2023/12/16

Wen-Chin Huang

H-Index: 9

Tomoki Toda

H-Index: 34

The singing voice conversion challenge 2023

2023/12/16

Wen-Chin Huang

H-Index: 9

Jiatong Shi

H-Index: 2

Tomoki Toda

H-Index: 34

Evaluating Methods for Ground-Truth-Free Foreign Accent Conversion

2023/10/31

Wen-Chin Huang

H-Index: 9

Tomoki Toda

H-Index: 34

AAS-VC: On the Generalization Ability of Automatic Alignment Search based Non-autoregressive Sequence-to-sequence Voice Conversion

arXiv preprint arXiv:2309.07598

2023/9/14

Wen-Chin Huang

H-Index: 9

Kazuhiro Kobayashi

H-Index: 16

Tomoki Toda

H-Index: 34

A holistic cascade system, benchmark, and human evaluation protocol for expressive speech-to-speech translation

2023/6/4

Wen-Chin Huang

H-Index: 9

Elizabeth Salesky

H-Index: 10

Ann Lee

H-Index: 8

Peng-Jen Chen

H-Index: 9

Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition

2023/6/4

Ding Ma

H-Index: 7

Wen-Chin Huang

H-Index: 9

Tomoki Toda

H-Index: 34

Generalization ability of mos prediction networks

2022/5/23

Wen-Chin Huang

H-Index: 9

Tomoki Toda

H-Index: 34

Investigating self-supervised pretraining frameworks for pathological speech recognition

arXiv preprint arXiv:2203.15431

2022/3/29

Wen-Chin Huang

H-Index: 9

Tomoki Toda

H-Index: 34

The voicemos challenge 2022

arXiv preprint arXiv:2203.11389

2022/3/21

Wen-Chin Huang

H-Index: 9

Tomoki Toda

H-Index: 34

SUPERB-SG: Enhanced speech processing universal PERformance benchmark for semantic and generative capabilities

arXiv preprint arXiv:2203.06849

2022/3/14

Heng-Jui Chang

H-Index: 0

Wen-Chin Huang

H-Index: 9

Cheng-I Jeff Lai

H-Index: 6

Jiatong Shi

H-Index: 2

Xuankai Chang

H-Index: 11

Shinji Watanabe

H-Index: 45

Abdelrahman Mohamed

H-Index: 2

Hung-Yi Lee

H-Index: 21

A comparative study of self-supervised speech representation based voice conversion

CASSP 2023 Satellite Workshop: SASB 2023: Self-Supervision in Audio, Speech and Beyond

2023/8/20

Siyang Wang

H-Index: 2

Gustav Eje Henter

H-Index: 15

Joakim Gustafson

H-Index: 14

End-to-end binaural speech synthesis

arXiv preprint arXiv:2207.03697

2022/7/8

Dejan Markovic

H-Index: 33

Ldnet: Unified listener dependent modeling in mos prediction for synthetic speech

2022/5/23

Wen-Chin Huang

H-Index: 9

Tomoki Toda

H-Index: 34

Towards identity preserving normal to dysarthric voice conversion

2022/5/23

Wen-Chin Huang

H-Index: 9

Odette Scharenborg

H-Index: 17

Tomoki Toda

H-Index: 34

S3prl-vc: Open-source voice conversion framework with self-supervised speech representations

2022/5/23

Wen-Chin Huang

H-Index: 9

Tomoki Hayashi

H-Index: 19

Hung-Yi Lee

H-Index: 21

Shinji Watanabe

H-Index: 45

Tomoki Toda

H-Index: 34

Investigation of Text-to-Speech-based Synthetic Parallel Data for Sequence-to-Sequence Non-Parallel Voice Conversion

2021/12/14

Ding Ma

H-Index: 7

Wen-Chin Huang

H-Index: 9

Tomoki Toda

H-Index: 34

Wen-Chin Huang

Nagoya University

About Wen-Chin Huang

Wen-Chin Huang Information

Wen-Chin Huang Skills & Research Interests

Top articles of Wen-Chin Huang

A Large-Scale Evaluation of Speech Foundation Models

Heng-Jui Chang

Haibin Wu

Jiatong Shi

Xuankai Chang

Wen-Chin Huang

Po-Han Chi

Yung-Sung Chuang

Wei-Cheng Tseng

Abdelrahman Mohamed

Shinji Watanabe

Hung-Yi Lee

Electrolaryngeal Speech Intelligibility Enhancement through Robust Linguistic Encoders

Wen-Chin Huang

Ding Ma

Kazuhiro Kobayashi

Tomoki Toda

A review on subjective and objective evaluation of synthetic speech

A Comparative Study of Voice Conversion Models With Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023

Wen-Chin Huang

Tomoki Toda

The VoiceMOS Challenge 2023: zero-shot subjective speech quality prediction for multiple domains

Wen-Chin Huang

Tomoki Toda

The singing voice conversion challenge 2023

Wen-Chin Huang

Jiatong Shi

Tomoki Toda

Evaluating Methods for Ground-Truth-Free Foreign Accent Conversion

Wen-Chin Huang

Tomoki Toda

AAS-VC: On the Generalization Ability of Automatic Alignment Search based Non-autoregressive Sequence-to-sequence Voice Conversion

Wen-Chin Huang

Kazuhiro Kobayashi

Tomoki Toda

A holistic cascade system, benchmark, and human evaluation protocol for expressive speech-to-speech translation

Wen-Chin Huang

Elizabeth Salesky

Ann Lee

Peng-Jen Chen

Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition

Ding Ma

Wen-Chin Huang

Tomoki Toda

Generalization ability of mos prediction networks

Wen-Chin Huang

Tomoki Toda

Investigating self-supervised pretraining frameworks for pathological speech recognition

Wen-Chin Huang

Tomoki Toda

The voicemos challenge 2022

Wen-Chin Huang

Tomoki Toda

SUPERB-SG: Enhanced speech processing universal PERformance benchmark for semantic and generative capabilities

Heng-Jui Chang

Wen-Chin Huang

Cheng-I Jeff Lai

Jiatong Shi

Xuankai Chang

Shinji Watanabe

Abdelrahman Mohamed

Hung-Yi Lee

A comparative study of self-supervised speech representation based voice conversion

Siyang Wang

Gustav Eje Henter

Joakim Gustafson

End-to-end binaural speech synthesis

Dejan Markovic

Ldnet: Unified listener dependent modeling in mos prediction for synthetic speech

Wen-Chin Huang

Tomoki Toda

Towards identity preserving normal to dysarthric voice conversion

Wen-Chin Huang

Odette Scharenborg