Sanjeev Khudanpur
Johns Hopkins University
H-index: 64
North America-United States
Top articles of Sanjeev Khudanpur
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Less Peaky and More Accurate CTC Forced Alignment by Label Priors | Ruizhe Huang Xiaohui Zhang Zhaoheng Ni Li Sun Moto Hira | 2024/4/14 | |
Enhancing code-switching speech recognition with interactive language biases | Hexin Liu Leibny Paola Garcia Xiangyu Zhang Andy WH Khong Sanjeev Khudanpur | 2024/4/14 | |
Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization | Amir Hussein Brian Yan Antonios Anastasopoulos Shinji Watanabe Sanjeev Khudanpur | 2024/4/14 | |
Speech collage: code-switched audio generation by collaging monolingual corpora | Amir Hussein Dorsa Zeinali Ondřej Klejch Matthew Wiesner Brian Yan | 2024/4/14 | |
On Speaker Attribution with SURT | arXiv preprint arXiv:2401.15676 | Desh Raj Matthew Wiesner Matthew Maciejewski Leibny Paola Garcia-Perera Daniel Povey | 2024/1/28 |
ConEC: Earnings call dataset with real-world contexts for benchmarking contextual speech recognition | Ruizhe Huang Mahsa Yarmohammad Jan Trmal Jing Liu Desh Raj | 2024 | |
MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization | Interspeech 2023 | Victoria YH Chua Hexin Liu Leibny Paola Garcia Perera Fei Ting Woon Jinyi Wong | 2023/5/30 |
EURO: ESPnet unsupervised asr open-source toolkit | Dongji Gao Jiatong Shi Shun-Po Chuang Leibny Paola Garcia Hung-yi Lee | 2023/6/4 | |
JHU IWSLT 2023 Multilingual Speech Translation System Description | Henry Li Xinyuan Neha Verma Bismarck Bamfo Odoom Ujvala Pradeep Matthew Wiesner | 2023/7 | |
Joint Energy-Based Model for Robust Speech Classification System Against Dirty-Label Backdoor Poisoning Attacks | Martin Sustek Sonal Joshi Henry Li Thomas Thebaud Jesús Villalba | 2023/12/16 | |
Investigating model performance in language identification: beyond simple error statistics | Interspeech 2023 | Suzy J Styles Victoria YH Chua Fei Ting Woon Hexin Liu Leibny Paola Garcia Perera | 2023/5/30 |
Adapting self-supervised models to multi-talker speech recognition using speaker embeddings | Zili Huang Desh Raj Paola García Sanjeev Khudanpur | 2023/6/4 | |
JHU IWSLT 2023 Dialect Speech Translation System Description | Amir Hussein Cihan Xiao Neha Verma Thomas Thebaud Matthew Wiesner | 2023/7 | |
Clustering Unsupervised Representations as Defense Against Poisoning Attacks on Speech Commands Classification System | Thomas Thebaud Sonal Joshi Henry Li Martin Sustek Jesús Villalba | 2023/12/16 | |
Textual data augmentation for Arabic-English code-switching speech recognition | Amir Hussein Shammur Absar Chowdhury Ahmed Abdelali Najim Dehak Ahmed Ali | 2023/1/9 | |
Reducing language confusion for code-switching speech recognition with token-level language diarization | Hexin Liu Haihua Xu Leibny Paola Garcia Andy WH Khong Yi He | 2023/6/4 | |
The chime-7 dasr challenge: Distant meeting transcription with multiple devices in diverse scenarios | arXiv preprint arXiv:2306.13734 | Samuele Cornell Matthew Wiesner Shinji Watanabe Desh Raj Xuankai Chang | 2023/6/23 |
Learning From Flawed Data: Weakly Supervised Automatic Speech Recognition | Dongji Gao Hainan Xu Desh Raj Leibny Paola Garcia Perera Daniel Povey | 2023/12/16 | |
A dilemma of ground truth in noisy speech separation and an to lessen the of data | COMPUTER SPEECH AND LANGUAGE | Matthew Maciejewski Jing Shi Shinji Watanabe Sanjeev Khudanpur | 2023/1/1 |
Bypass temporal classification: Weakly supervised automatic speech recognition with imperfect transcripts | arXiv preprint arXiv:2306.01031 | Dongji Gao Matthew Wiesner Hainan Xu Leibny Paola Garcia Daniel Povey | 2023/6/1 |