Siddhant Arora

Siddhant Arora

Carnegie Mellon University

H-index: 9

North America-United States

About Siddhant Arora

Siddhant Arora, With an exceptional h-index of 9 and a recent h-index of 9 (since 2020), a distinguished researcher at Carnegie Mellon University, specializes in the field of Machine Learning, Speech Processing, Natural language processing.

His recent articles reflect a diverse array of research interests and contributions to the field:

Phoneme-aware Encoding for Prefix-tree-based Contextual ASR

Semi-Autoregressive Streaming ASR with Label Context

Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech

TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages

OWSM v3. 1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

The Pipeline System of ASR and NLU with MLM-based data Augmentation Toward Stop Low-Resource Challenge

Joint modelling of spoken language understanding tasks with integrated dialog history

Streaming joint speech recognition and disfluency detection

Siddhant Arora Information

University

Position

Graduate Student

Citations(all)

317

Citations(since 2020)

317

Cited By

17

hIndex(all)

9

hIndex(since 2020)

9

i10Index(all)

9

i10Index(since 2020)

9

Email

University Profile Page

Google Scholar

Siddhant Arora Skills & Research Interests

Machine Learning

Speech Processing

Natural language processing

Top articles of Siddhant Arora

Phoneme-aware Encoding for Prefix-tree-based Contextual ASR

2024/4/14

Siddhant Arora
Siddhant Arora

H-Index: 3

Shinji Watanabe
Shinji Watanabe

H-Index: 45

Semi-Autoregressive Streaming ASR with Label Context

2024/4/14

Siddhant Arora
Siddhant Arora

H-Index: 3

Shinji Watanabe
Shinji Watanabe

H-Index: 45

Dynamic-superb: Towards a dynamic, collaborative, and comprehensive instruction-tuning benchmark for speech

2024/4/14

TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages

arXiv preprint arXiv:2402.16021

2024/2/25

OWSM v3. 1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer

arXiv preprint arXiv:2401.16658

2024/1/30

The Pipeline System of ASR and NLU with MLM-based data Augmentation Toward Stop Low-Resource Challenge

2023/6/4

Joint modelling of spoken language understanding tasks with integrated dialog history

2023/6/4

Streaming joint speech recognition and disfluency detection

2023/6/4

Siddhant Arora
Siddhant Arora

H-Index: 3

Shinji Watanabe
Shinji Watanabe

H-Index: 45

Tensor decomposition for minimization of E2E SLU model toward on-device processing

arXiv preprint arXiv:2306.01247

2023/6/2

A comparative study on E-branchformer vs conformer in speech recognition, translation, and understanding tasks

arXiv preprint arXiv:2305.11073

2023/5/18

A study on the integration of pre-trained ssl, asr, lm and slu models for spoken language understanding

2023/1/9

Teaching Old DB Neu (ral) Tricks: Learning Embeddings on Multi-tabular Databases

2023/1/4

Espnet-Summ: Introducing a Novel Large Dataset, Toolkit, and a Cross-Corpora Evaluation of Speech Summarization Systems

2023/12/16

Universlu: Universal spoken language understanding for diverse classification and sequence generation tasks with a single network

arXiv preprint arXiv:2310.02973

2023/10/4

Decoder-only architecture for speech recognition with ctc prompts and text data augmentation

arXiv preprint arXiv:2309.08876

2023/9/16

Siddhant Arora
Siddhant Arora

H-Index: 3

Shinji Watanabe
Shinji Watanabe

H-Index: 45

Integration of Frame-and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition

arXiv preprint arXiv:2307.12767

2023/7/24

Siddhant Arora
Siddhant Arora

H-Index: 3

Shinji Watanabe
Shinji Watanabe

H-Index: 45

Integrating pretrained ASR and LM to perform sequence generation for spoken language understanding

arXiv preprint arXiv:2307.11005

2023/7/20

BASS: Block-wise Adaptation for Speech Summarization

arXiv preprint arXiv:2307.08217

2023/7/17

Cmu’s iwslt 2023 simultaneous speech translation system

2023/7

See List of Professors in Siddhant Arora University(Carnegie Mellon University)

Co-Authors

academic-engine