Yanmin Qian
Shanghai Jiao Tong University
H-index: 40
Asia-China
Top articles of Yanmin Qian
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
One-shot sensitivity-aware mixed sparsity pruning for large language models | Hang Shao Bei Liu Yanmin Qian | 2024/4/14 | |
Exploring Large Scale Pre-Trained Models for Robust Machine Anomalous Sound Detection | Bing Han Zhiqiang Lv Anbai Jiang Wen Huang Zhengyang Chen | 2024/4/14 | |
Leveraging in-the-wild Data for Effective Self-supervised Pretraining in Speaker Recognition | Shuai Wang Qibing Bai Qi Liu Jianwei Yu Zhengyang Chen | 2024/4/14 | |
Generation-Based Target Speech Extraction with Speech Discretization and Vocoder | Linfeng Yu Wangyou Zhang Chenpeng Du Leying Zhang Zheng Liang | 2024/4/14 | |
Unified Cross-Modal Attention: Robust Audio-Visual Speech Recognition and Beyond | IEEE/ACM Transactions on Audio, Speech, and Language Processing | Jiahong Li Chenda Li Yifei Wu Yanmin Qian | 2024/3/18 |
Robust Cross-Domain Speaker Verification with Multi-Level Domain Adapters | Wen Huang Bing Han Shuai Wang Zhengyang Chen Yanmin Qian | 2024/4/14 | |
Attention-based encoder-decoder end-to-end neural diarization with embedding enhancer | IEEE/ACM Transactions on Audio, Speech, and Language Processing | Zhengyang Chen Bing Han Shuai Wang Yanmin Qian | 2024/2/16 |
Improving Design of Input Condition Invariant Speech Enhancement | IEEE ICASSP 2024 | Wangyou Zhang* Jee-weon Jung* Shinji Watanabe Yanmin Qian | 2024/4/14 |
Comsl: A composite speech-language model for end-to-end speech-to-text translation | Advances in Neural Information Processing Systems | Chenyang Le Yao Qian Long Zhou Shujie Liu Yanmin Qian | 2024/2/13 |
Prompt-driven target speech diarization | Yidi Jiang Zhengyang Chen Ruijie Tao Liqun Deng Yanmin Qian | 2024/4/14 | |
GSTalker: Real-time Audio-Driven Talking Face Generation via Deformable Gaussian Splatting | arXiv preprint arXiv:2404.19040 | Bo Chen Shoukang Hu Qi Chen Chenpeng Du Ran Yi | 2024/4/29 |
Advanced Long-Content Speech Recognition with Factorized Neural Transducer | IEEE/ACM Transactions on Audio, Speech, and Language Processing | Xun Gong Yu Wu Jinyu Li Shujie Liu Rui Zhao | 2024/1/12 |
Predictive SkiM: Contrastive predictive coding for low-latency online speech separation | Chenda Li Yifei Wu Yanmin Qian | 2023/6/4 | |
Weakly-supervised speech pre-training: A case study on target speech recognition | arXiv preprint arXiv:2305.16286 | Wangyou Zhang Yanmin Qian | 2023/5/25 |
Improving dino-based self-supervised speaker verification with progressive cluster-aware training | Bing Han Wen Huang Zhengyang Chen Yanmin Qian | 2023/6/4 | |
Diffusion Conditional Expectation Model for Efficient and Robust Target Speech Extraction | arXiv preprint arXiv:2309.13874 | Leying Zhang Yao Qian Linfeng Yu Heming Wang Xinkai Wang | 2023/9/25 |
Toward universal speech enhancement for diverse input conditions | Wangyou Zhang Kohei Saijo Zhong-Qiu Wang Shinji Watanabe Yanmin Qian | 2023/12/16 | |
Universal Cross-Lingual Data Generation for Low Resource ASR | IEEE/ACM Transactions on Audio, Speech, and Language Processing | Wei Wang Yanmin Qian | 2023/12/22 |
End-to-end multi-speaker ASR with independent vector analysis | Robin Scheibler Wangyou Zhang Xuankai Chang Shinji Watanabe Yanmin Qian | 2023/1/9 | |
Whisper-KDQ: A Lightweight Whisper via Guided Knowledge Distillation and Quantization for Efficient ASR | arXiv preprint arXiv:2305.10788 | Hang Shao Wei Wang Bei Liu Xun Gong Haoyu Wang | 2023/5/18 |