Siddhant Arora
Carnegie Mellon University
H-index: 9
North America-United States
Top articles of Siddhant Arora
Phoneme-aware Encoding for Prefix-tree-based Contextual ASR
2024/4/14
Siddhant Arora
H-Index: 3
Shinji Watanabe
H-Index: 45
Semi-Autoregressive Streaming ASR with Label Context
2024/4/14
Siddhant Arora
H-Index: 3
Shinji Watanabe
H-Index: 45
Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech
2024/4/14
TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages
arXiv preprint arXiv:2402.16021
2024/2/25
OWSM v3. 1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer
arXiv preprint arXiv:2401.16658
2024/1/30
The Pipeline System of ASR and NLU with MLM-based data Augmentation Toward Stop Low-Resource Challenge
2023/6/4
Siddhant Arora
H-Index: 3
Shih-Lun Wu
H-Index: 2
Yifan Peng
H-Index: 0
Brian Yan
H-Index: 0
Shinji Watanabe
H-Index: 45
Joint modelling of spoken language understanding tasks with integrated dialog history
2023/6/4
Streaming joint speech recognition and disfluency detection
2023/6/4
Siddhant Arora
H-Index: 3
Shinji Watanabe
H-Index: 45
Tensor decomposition for minimization of E2E SLU model toward on-device processing
arXiv preprint arXiv:2306.01247
2023/6/2
Siddhant Arora
H-Index: 3
Shih-Lun Wu
H-Index: 2
Yifan Peng
H-Index: 0
Brian Yan
H-Index: 0
Shinji Watanabe
H-Index: 45
A comparative study on E-branchformer vs conformer in speech recognition, translation, and understanding tasks
arXiv preprint arXiv:2305.11073
2023/5/18
Yifan Peng
H-Index: 0
Brian Yan
H-Index: 0
Siddhant Arora
H-Index: 3
William Chen
H-Index: 4
Shinji Watanabe
H-Index: 45
A study on the integration of pre-trained ssl, asr, lm and slu models for spoken language understanding
2023/1/9
Teaching Old DB Neu (ral) Tricks: Learning Embeddings on Multi-tabular Databases
2023/1/4
Garima Gaur
H-Index: 1
Rajat Singh
H-Index: 3
Siddhant Arora
H-Index: 3
Vinayak Gupta
H-Index: 2
Srikanta Bedathur
H-Index: 14
Espnet-Summ: Introducing a Novel Large Dataset, Toolkit, and a Cross-Corpora Evaluation of Speech Summarization Systems
2023/12/16
Reproducing whisper-style training using an open-source toolkit and publicly available data
2023/12/16
Universlu: Universal spoken language understanding for diverse classification and sequence generation tasks with a single network
arXiv preprint arXiv:2310.02973
2023/10/4
Siddhant Arora
H-Index: 3
Jee-Weon Jung
H-Index: 11
Yifan Peng
H-Index: 0
Roshan Sharma
H-Index: 3
Shinji Watanabe
H-Index: 45
Decoder-only architecture for speech recognition with ctc prompts and text data augmentation
arXiv preprint arXiv:2309.08876
2023/9/16
Siddhant Arora
H-Index: 3
Shinji Watanabe
H-Index: 45
Integration of Frame-and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition
arXiv preprint arXiv:2307.12767
2023/7/24
Siddhant Arora
H-Index: 3
Shinji Watanabe
H-Index: 45
Integrating pretrained ASR and LM to perform sequence generation for spoken language understanding
arXiv preprint arXiv:2307.11005
2023/7/20
BASS: Block-wise Adaptation for Speech Summarization
arXiv preprint arXiv:2307.08217
2023/7/17
Cmu’s iwslt 2023 simultaneous speech translation system
2023/7