Shuai Wang
Shanghai Jiao Tong University
H-index: 22
Asia-China
Top articles of Shuai Wang
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-Talker Speech | Junjie Li Ruijie Tao Zexu Pan Meng Ge Shuai Wang | 2024/4/14 | |
The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge | arXiv preprint arXiv:2404.06079 | Yiwei Guo Chenrun Wang Yifan Yang Hankun Wang Ziyang Ma | 2024/4/9 |
Robust Cross-Domain Speaker Verification with Multi-Level Domain Adapters | Wen Huang Bing Han Shuai Wang Zhengyang Chen Yanmin Qian | 2024/4/14 | |
AutoPrep: An Automatic Preprocessing Framework for In-The-Wild Speech Data | arXiv preprint arXiv:2309.13905 | Jianwei Yu Hangting Chen Yanyao Bian Xiang Li Yi Luo | 2023/9/25 |
Attention-based encoder-decoder end-to-end neural diarization with embedding enhancer | IEEE/ACM Transactions on Audio, Speech, and Language Processing | Zhengyang Chen Bing Han Shuai Wang Yanmin Qian | 2024/2/16 |
VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech | arXiv preprint arXiv:2401.14321 | Chenpeng Du Yiwei Guo Hankun Wang Yifan Yang Zhikang Niu | 2024/1/25 |
Dualvc 2: Dynamic masked convolution for unified streaming and non-streaming voice conversion | Ziqian Ning Yuepeng Jiang Pengcheng Zhu Shuai Wang Jixun Yao | 2024/4/14 | |
Advancing Speaker Embedding Learning: Wespeaker Toolkit for Research and Production | Available at SSRN 4748855 | Shuai Wang Zhengyang Chen Bing Han Hongji Wang Chengdong Liang | 2024 |
Leveraging in-the-wild Data for Effective Self-supervised Pretraining in Speaker Recognition | Shuai Wang Qibing Bai Qi Liu Jianwei Yu Zhengyang Chen | 2024/4/14 | |
Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor | arXiv preprint arXiv:2305.10704 | Zhengyang Chen Bing Han Shuai Wang Yanmin Qian | 2023/5/18 |
USED: Universal Speaker Extraction and Diarization | arXiv preprint arXiv:2309.10674 | Junyi Ao Mehmet Sinan Yıldırım Meng Ge Shuai Wang Ruijie Tao | 2023/9/19 |
Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion | Xintao Zhao Shuai Wang Yang Chao Zhiyong Wu Helen Meng | 2023/7/10 | |
Wespeaker baselines for VoxSRC2023 | arXiv preprint arXiv:2306.15161 | Shuai Wang Chengdong Liang Xu Xiang Bing Han Zhengyang Chen | 2023/6/27 |
UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding | Proceedings of the AAAI Conference on Artificial Intelligence | Chenpeng Du Yiwei Guo Feiyu Shen Zhijun Liu Zheng Liang | 2024/3/24 |
Wespeaker: A research and production oriented speaker embedding learning toolkit | Hongji Wang Chengdong Liang Shuai Wang Zhengyang Chen Binbin Zhang | 2023/6/4 | |
DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding | arXiv preprint arXiv:2305.12425 | Ziqian Ning Yuepeng Jiang Pengcheng Zhu Jixun Yao Shuai Wang | 2023/5/21 |
Context-aware Multimodal Fusion for Emotion Recognition. | Jinchao Li Shuai Wang Yang Chao Xunying Liu Helen Meng | 2022 | |
On the Importance of Different Frequency Bins for Speaker Verification | Aiwen Deng Shuai Wang Wenxiong Kang Feiqi Deng | 2022/5/23 | |
Self-Knowledge Distillation via Feature Enhancement for Speaker Verification | Bei Liu Haoyu Wang Zhengyang Chen Shuai Wang Yanmin Qian | 2022/5/23 | |
DF-ResNet: Boosting Speaker Verification Performance with Depth-First Design. | Bei Liu Zhengyang Chen Shuai Wang Haoyu Wang Bing Han | 2022 |