Xixin Wu
University of Cambridge
H-index: 19
Europe-United Kingdom
Top articles of Xixin Wu
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Cross-Speaker Encoding Network for Multi-Talker Speech Recognition | arXiv preprint arXiv:2401.04152 | Jiawen Kang Lingwei Meng Mingyu Cui Haohan Guo Xixin Wu | 2024/1/8 |
Stylespeech: Self-supervised style enhancing with vq-vae-based pre-training for expressive audiobook speech synthesis | Xueyuan Chen Xi Wang Shaofei Zhang Lei He Zhiyong Wu | 2024/4/14 | |
Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts | Shun Lei Yixuan Zhou Liyang Chen Dan Luo Zhiyong Wu | 2024/4/14 | |
SimCalib: Graph Neural Network Calibration based on Similarity between Nodes | Proceedings of the AAAI Conference on Artificial Intelligence | Boshi Tang Zhiyong Wu Xixin Wu Qiaochu Huang Jun Chen | 2024/3/24 |
Exploiting Audio-Visual Features with Pretrained AV-HuBERT for Multi-Modal Dysarthric Speech Reconstruction | arXiv preprint arXiv:2401.17796 | Xueyuan Chen Yuejiao Wang Xixin Wu Disong Wang Zhiyong Wu | 2024/1/31 |
UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization | arXiv preprint arXiv:2401.14664 | Yuejiao Wang Xixin Wu Disong Wang Lingwei Meng Helen Meng | 2024/1/26 |
Unifying One-Shot Voice Conversion and Cloning with Disentangled Speech Representations | Hui Lu Xixin Wu Haohan Guo Songxiang Liu Zhiyong Wu | 2024/4/14 | |
A sidecar separator can convert a single-talker speech recognition system to a multi-talker one | Lingwei Meng Jiawen Kang Mingyu Cui Yuejiao Wang Xixin Wu | 2023/6/4 | |
MSStyleTTS: Multi-Scale Style Modeling with Hierarchical Context Information for Expressive Speech Synthesis | IEEE/ACM Transactions on Audio, Speech, and Language Processing | Shun Lei Yixuan Zhou Liyang Chen Zhiyong Wu Xixin Wu | 2023/8/2 |
Natural language embedded programs for hybrid language symbolic reasoning | arXiv preprint arXiv:2309.10814 | Tianhua Zhang Jiaxin Ge Hongyin Luo Yung-Sung Chuang Mingye Gao | 2023/9/19 |
Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE | Hui Lu Disong Wang Xixin Wu Zhiyong Wu Xunying Liu | 2023/1/9 | |
Unified modeling of multi-talker overlapped speech recognition and diarization with a sidecar separator | arXiv preprint arXiv:2305.16263 | Lingwei Meng Jiawen Kang Mingyu Cui Haibin Wu Xixin Wu | 2023/5/25 |
Integrated and enhanced pipeline system to support spoken language analytics for screening neurocognitive disorders | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH | Helen Meng Brian Mak Man-Wai Mak Helene Fung Xianmin Gong | 2023/8 |
Hiformer: Sequence Modeling Networks with Hierarchical Attention Mechanisms | IEEE/ACM Transactions on Audio, Speech, and Language Processing | Xixin Wu Hui Lu Kun Li Zhiyong Wu Xunying Liu | 2023/9/8 |
Search augmented instruction learning | Hongyin Luo Tianhua Zhang Yung-Sung Chuang Yuan Gong Yoon Kim | 2023/12 | |
Sail: Search-augmented instruction learning | arXiv preprint arXiv:2305.15225 | Hongyin Luo Yung-Sung Chuang Yuan Gong Tianhua Zhang Yoon Kim | 2023/5/24 |
ConvRGX: Recognition, Generation, and Extraction for Self-trained Conversational Question Answering | Tianhua Zhang Liping Tang Wei Fang Hongyin Luo Xixin Wu | 2023/7 | |
QS-TTS: towards semi-supervised text-to-speech synthesis via vector-quantized self-supervised speech representation learning | arXiv preprint arXiv:2309.00126 | Haohan Guo Fenglong Xie Jiawen Kang Yujia Xiao Xixin Wu | 2023/8/31 |
Injecting linguistic knowledge into BERT for Dialogue State Tracking | arXiv preprint arXiv:2311.15623 | Xiaohan Feng Xixin Wu Helen Meng | 2023/11/27 |
MSMC-TTS: Multi-stage multi-codebook VQ-VAE based neural TTS | IEEE/ACM Transactions on Audio, Speech, and Language Processing | Haohan Guo Fenglong Xie Xixin Wu Frank K Soong Helen MengFellow | 2023/5/2 |