Siddhant Arora at Carnegie Mellon University

University	Carnegie Mellon University
Position	Graduate Student
Citations(all)	317
Citations(since 2020)	317
Cited By	17
hIndex(all)	9
hIndex(since 2020)	9
i10Index(all)	9
i10Index(since 2020)	9
Email	Access Email
University Profile Page	Carnegie Mellon University
Google Scholar	View Google Scholar Profile

Phoneme-aware Encoding for Prefix-tree-based Contextual ASR

2024/4/14

Siddhant Arora

H-Index: 3

Shinji Watanabe

H-Index: 45

Semi-Autoregressive Streaming ASR with Label Context

2024/4/14

Siddhant Arora

H-Index: 3

Shinji Watanabe

H-Index: 45

Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech

2024/4/14

Chien-Yu Huang

H-Index: 1

Haibin Wu

H-Index: 6

Siddhant Arora

H-Index: 3

Kai-Wei Chang

H-Index: 3

Jiatong Shi

H-Index: 2

Yifan Peng

H-Index: 0

Roshan Sharma

H-Index: 3

Shinji Watanabe

H-Index: 45

Hung-Yi Lee

H-Index: 21

TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages

arXiv preprint arXiv:2402.16021

2024/2/25

Minsu Kim

H-Index: 2

Jee-Weon Jung

H-Index: 11

Siddhant Arora

H-Index: 3

Xuankai Chang

H-Index: 11

Shinji Watanabe

H-Index: 45

Yong Man Ro

H-Index: 32

OWSM v3. 1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

arXiv preprint arXiv:2401.16658

2024/1/30

Yifan Peng

H-Index: 0

William Chen

H-Index: 4

Siddhant Arora

H-Index: 3

Brian Yan

H-Index: 0

Muhammad Shakeel

H-Index: 4

Kwanghee Choi

H-Index: 1

Jiatong Shi

H-Index: 2

Xuankai Chang

H-Index: 11

Jee-Weon Jung

H-Index: 11

Shinji Watanabe

H-Index: 45

The Pipeline System of ASR and NLU with MLM-based data Augmentation Toward Stop Low-Resource Challenge

2023/6/4

Siddhant Arora

H-Index: 3

Shih-Lun Wu

H-Index: 2

Yifan Peng

H-Index: 0

Brian Yan

H-Index: 0

Shinji Watanabe

H-Index: 45

Joint modelling of spoken language understanding tasks with integrated dialog history

2023/6/4

Siddhant Arora

H-Index: 3

Brian Yan

H-Index: 0

Shinji Watanabe

H-Index: 45

Streaming joint speech recognition and disfluency detection

2023/6/4

Siddhant Arora

H-Index: 3

Shinji Watanabe

H-Index: 45

Tensor decomposition for minimization of E2E SLU model toward on-device processing

arXiv preprint arXiv:2306.01247

2023/6/2

Siddhant Arora

H-Index: 3

Shih-Lun Wu

H-Index: 2

Yifan Peng

H-Index: 0

Brian Yan

H-Index: 0

Shinji Watanabe

H-Index: 45

A comparative study on E-branchformer vs conformer in speech recognition, translation, and understanding tasks

arXiv preprint arXiv:2305.11073

2023/5/18

Yifan Peng

H-Index: 0

Brian Yan

H-Index: 0

Siddhant Arora

H-Index: 3

William Chen

H-Index: 4

Shinji Watanabe

H-Index: 45

A study on the integration of pre-trained ssl, asr, lm and slu models for spoken language understanding

2023/1/9

Yifan Peng

H-Index: 0

Siddhant Arora

H-Index: 3

Yosuke Higuchi

H-Index: 3

Karthik Ganesan

H-Index: 3

Siddharth Dalmia

H-Index: 7

Xuankai Chang

H-Index: 11

Shinji Watanabe

H-Index: 45

Teaching Old DB Neu (ral) Tricks: Learning Embeddings on Multi-tabular Databases

2023/1/4

Garima Gaur

H-Index: 1

Rajat Singh

H-Index: 3

Siddhant Arora

H-Index: 3

Vinayak Gupta

H-Index: 2

Srikanta Bedathur

H-Index: 14

Espnet-Summ: Introducing a Novel Large Dataset, Toolkit, and a Cross-Corpora Evaluation of Speech Summarization Systems

2023/12/16

Roshan Sharma

H-Index: 3

William Chen

H-Index: 4

Ruchira Sharma

H-Index: 4

Siddhant Arora

H-Index: 3

Shinji Watanabe

H-Index: 45

Rita Singh

H-Index: 37

Bhiksha Raj

H-Index: 40

Reproducing whisper-style training using an open-source toolkit and publicly available data

2023/12/16

Yifan Peng

H-Index: 0

Brian Yan

H-Index: 0

Xuankai Chang

H-Index: 11

Xinjian Li

H-Index: 5

Jiatong Shi

H-Index: 2

Siddhant Arora

H-Index: 3

William Chen

H-Index: 4

Roshan Sharma

H-Index: 3

Wangyou Zhang

H-Index: 5

Muhammad Shakeel

H-Index: 4

Jee-Weon Jung

H-Index: 11

Shinji Watanabe

H-Index: 45

Universlu: Universal spoken language understanding for diverse classification and sequence generation tasks with a single network

arXiv preprint arXiv:2310.02973

2023/10/4

Siddhant Arora

H-Index: 3

Jee-Weon Jung

H-Index: 11

Yifan Peng

H-Index: 0

Roshan Sharma

H-Index: 3

Shinji Watanabe

H-Index: 45

Decoder-only architecture for speech recognition with ctc prompts and text data augmentation

arXiv preprint arXiv:2309.08876

2023/9/16

Siddhant Arora

H-Index: 3

Shinji Watanabe

H-Index: 45

Integration of Frame-and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition

arXiv preprint arXiv:2307.12767

2023/7/24

Siddhant Arora

H-Index: 3

Shinji Watanabe

H-Index: 45

Integrating pretrained ASR and LM to perform sequence generation for spoken language understanding

arXiv preprint arXiv:2307.11005

2023/7/20

Siddhant Arora

H-Index: 3

Brian Yan

H-Index: 0

Shinji Watanabe

H-Index: 45

BASS: Block-wise Adaptation for Speech Summarization

arXiv preprint arXiv:2307.08217

2023/7/17

Roshan Sharma

H-Index: 3

Kenneth Zheng

H-Index: 3

Siddhant Arora

H-Index: 3

Shinji Watanabe

H-Index: 45

Rita Singh

H-Index: 37

Bhiksha Raj

H-Index: 40

Cmu’s iwslt 2023 simultaneous speech translation system

2023/7

Brian Yan

H-Index: 0

Jiatong Shi

H-Index: 2

William Chen

H-Index: 4

Xinjian Li

H-Index: 5

Yifan Peng

H-Index: 0

Siddhant Arora

H-Index: 3

Shinji Watanabe

H-Index: 45

Siddhant Arora

Carnegie Mellon University

About Siddhant Arora

Siddhant Arora Information

Siddhant Arora Skills & Research Interests

Top articles of Siddhant Arora

Phoneme-aware Encoding for Prefix-tree-based Contextual ASR

Siddhant Arora

Shinji Watanabe

Semi-Autoregressive Streaming ASR with Label Context

Siddhant Arora

Shinji Watanabe

Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech

Chien-Yu Huang

Haibin Wu

Siddhant Arora

Kai-Wei Chang

Jiatong Shi

Yifan Peng

Roshan Sharma

Shinji Watanabe

Hung-Yi Lee

TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages

Minsu Kim

Jee-Weon Jung

Siddhant Arora

Xuankai Chang

Shinji Watanabe

Yong Man Ro

OWSM v3. 1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

Yifan Peng

William Chen

Siddhant Arora

Brian Yan

Muhammad Shakeel

Kwanghee Choi

Jiatong Shi

Xuankai Chang

Jee-Weon Jung

Shinji Watanabe

The Pipeline System of ASR and NLU with MLM-based data Augmentation Toward Stop Low-Resource Challenge

Siddhant Arora

Shih-Lun Wu

Yifan Peng

Brian Yan

Shinji Watanabe

Joint modelling of spoken language understanding tasks with integrated dialog history

Siddhant Arora

Brian Yan

Shinji Watanabe

Streaming joint speech recognition and disfluency detection

Siddhant Arora

Shinji Watanabe

Tensor decomposition for minimization of E2E SLU model toward on-device processing

Siddhant Arora

Shih-Lun Wu

Yifan Peng

Brian Yan

Shinji Watanabe

A comparative study on E-branchformer vs conformer in speech recognition, translation, and understanding tasks

Yifan Peng

Brian Yan

Siddhant Arora

William Chen

Shinji Watanabe

A study on the integration of pre-trained ssl, asr, lm and slu models for spoken language understanding

Yifan Peng

Siddhant Arora

Yosuke Higuchi

Karthik Ganesan

Siddharth Dalmia

Xuankai Chang

Shinji Watanabe

Teaching Old DB Neu (ral) Tricks: Learning Embeddings on Multi-tabular Databases

Garima Gaur

Rajat Singh

Siddhant Arora

Vinayak Gupta

Srikanta Bedathur

Espnet-Summ: Introducing a Novel Large Dataset, Toolkit, and a Cross-Corpora Evaluation of Speech Summarization Systems