Shuai Wang

Shuai Wang

Shanghai Jiao Tong University

H-index: 22

Asia-China

About Shuai Wang

Shuai Wang, With an exceptional h-index of 22 and a recent h-index of 22 (since 2020), a distinguished researcher at Shanghai Jiao Tong University, specializes in the field of speaker recognition, deep learning, speech processing.

His recent articles reflect a diverse array of research interests and contributions to the field:

Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-Talker Speech

The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge

Robust Cross-Domain Speaker Verification with Multi-Level Domain Adapters

AutoPrep: An Automatic Preprocessing Framework for In-The-Wild Speech Data

Attention-based encoder-decoder end-to-end neural diarization with embedding enhancer

VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech

Dualvc 2: Dynamic masked convolution for unified streaming and non-streaming voice conversion

Advancing Speaker Embedding Learning: Wespeaker Toolkit for Research and Production

Shuai Wang Information

University

Position

___

Citations(all)

1492

Citations(since 2020)

1460

Cited By

458

hIndex(all)

22

hIndex(since 2020)

22

i10Index(all)

35

i10Index(since 2020)

35

Email

University Profile Page

Shanghai Jiao Tong University

Google Scholar

View Google Scholar Profile

Shuai Wang Skills & Research Interests

speaker recognition

deep learning

speech processing

Top articles of Shuai Wang

Title

Journal

Author(s)

Publication Date

Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-Talker Speech

Junjie Li

Ruijie Tao

Zexu Pan

Meng Ge

Shuai Wang

...

2024/4/14

The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge

arXiv preprint arXiv:2404.06079

Yiwei Guo

Chenrun Wang

Yifan Yang

Hankun Wang

Ziyang Ma

...

2024/4/9

Robust Cross-Domain Speaker Verification with Multi-Level Domain Adapters

Wen Huang

Bing Han

Shuai Wang

Zhengyang Chen

Yanmin Qian

2024/4/14

AutoPrep: An Automatic Preprocessing Framework for In-The-Wild Speech Data

arXiv preprint arXiv:2309.13905

Jianwei Yu

Hangting Chen

Yanyao Bian

Xiang Li

Yi Luo

...

2023/9/25

Attention-based encoder-decoder end-to-end neural diarization with embedding enhancer

IEEE/ACM Transactions on Audio, Speech, and Language Processing

Zhengyang Chen

Bing Han

Shuai Wang

Yanmin Qian

2024/2/16

VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech

arXiv preprint arXiv:2401.14321

Chenpeng Du

Yiwei Guo

Hankun Wang

Yifan Yang

Zhikang Niu

...

2024/1/25

Dualvc 2: Dynamic masked convolution for unified streaming and non-streaming voice conversion

Ziqian Ning

Yuepeng Jiang

Pengcheng Zhu

Shuai Wang

Jixun Yao

...

2024/4/14

Advancing Speaker Embedding Learning: Wespeaker Toolkit for Research and Production

Available at SSRN 4748855

Shuai Wang

Zhengyang Chen

Bing Han

Hongji Wang

Chengdong Liang

...

2024

Leveraging in-the-wild Data for Effective Self-supervised Pretraining in Speaker Recognition

Shuai Wang

Qibing Bai

Qi Liu

Jianwei Yu

Zhengyang Chen

...

2024/4/14

Attention-based Encoder-Decoder Network for End-to-End Neural Speaker Diarization with Target Speaker Attractor

arXiv preprint arXiv:2305.10704

Zhengyang Chen

Bing Han

Shuai Wang

Yanmin Qian

2023/5/18

USED: Universal Speaker Extraction and Diarization

arXiv preprint arXiv:2309.10674

Junyi Ao

Mehmet Sinan Yıldırım

Meng Ge

Shuai Wang

Ruijie Tao

...

2023/9/19

Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion

Xintao Zhao

Shuai Wang

Yang Chao

Zhiyong Wu

Helen Meng

2023/7/10

Wespeaker baselines for VoxSRC2023

arXiv preprint arXiv:2306.15161

Shuai Wang

Chengdong Liang

Xu Xiang

Bing Han

Zhengyang Chen

...

2023/6/27

UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding

Proceedings of the AAAI Conference on Artificial Intelligence

Chenpeng Du

Yiwei Guo

Feiyu Shen

Zhijun Liu

Zheng Liang

...

2024/3/24

Wespeaker: A research and production oriented speaker embedding learning toolkit

Hongji Wang

Chengdong Liang

Shuai Wang

Zhengyang Chen

Binbin Zhang

...

2023/6/4

DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding

arXiv preprint arXiv:2305.12425

Ziqian Ning

Yuepeng Jiang

Pengcheng Zhu

Jixun Yao

Shuai Wang

...

2023/5/21

Context-aware Multimodal Fusion for Emotion Recognition.

Jinchao Li

Shuai Wang

Yang Chao

Xunying Liu

Helen Meng

2022

On the Importance of Different Frequency Bins for Speaker Verification

Aiwen Deng

Shuai Wang

Wenxiong Kang

Feiqi Deng

2022/5/23

Self-Knowledge Distillation via Feature Enhancement for Speaker Verification

Bei Liu

Haoyu Wang

Zhengyang Chen

Shuai Wang

Yanmin Qian

2022/5/23

DF-ResNet: Boosting Speaker Verification Performance with Depth-First Design.

Bei Liu

Zhengyang Chen

Shuai Wang

Haoyu Wang

Bing Han

...

2022

See List of Professors in Shuai Wang University(Shanghai Jiao Tong University)

Co-Authors

H-index: 76
Haizhou Li

Haizhou Li

National University of Singapore

H-index: 60
Lukas Burget

Lukas Burget

Vysoké ucení technické v Brne

H-index: 49
Kai Yu(俞凯)

Kai Yu(俞凯)

Shanghai Jiao Tong University

H-index: 47
Jan Cernocky

Jan Cernocky

Vysoké ucení technické v Brne

H-index: 40
Yanmin Qian

Yanmin Qian

Shanghai Jiao Tong University

H-index: 33
Oldřich Plchot

Oldřich Plchot

Vysoké ucení technické v Brne

academic-engine