Xinjian Li

Xinjian Li

Carnegie Mellon University

H-index: 12

North America-United States

About Xinjian Li

Xinjian Li, With an exceptional h-index of 12 and a recent h-index of 12 (since 2020), a distinguished researcher at Carnegie Mellon University, specializes in the field of speech recognition.

His recent articles reflect a diverse array of research interests and contributions to the field:

Text-Inductive Graphone-Based Language Adaptation for Low-Resource Speech Synthesis

Learning to speak from text: Zero-shot multilingual text-to-speech with unsupervised text pretraining

Yodas: Youtube-Oriented Dataset for Audio and Speech

Reproducing whisper-style training using an open-source toolkit and publicly available data

Cmu’s iwslt 2023 simultaneous speech translation system

Phone Inventories and Recognition for Every Language

Textless Direct Speech-to-Speech Translation with Discrete Speech Representation

ASR2K: Speech Recognition for Around 2000 Languages without Audio

Xinjian Li Information

University

Position

___

Citations(all)

463

Citations(since 2020)

456

Cited By

91

hIndex(all)

12

hIndex(since 2020)

12

i10Index(all)

14

i10Index(since 2020)

14

Email

University Profile Page

Carnegie Mellon University

Google Scholar

View Google Scholar Profile

Xinjian Li Skills & Research Interests

speech recognition

Top articles of Xinjian Li

Title

Journal

Author(s)

Publication Date

Text-Inductive Graphone-Based Language Adaptation for Low-Resource Speech Synthesis

IEEE/ACM Transactions on Audio, Speech, and Language Processing

Takaaki Saeki

Soumi Maiti

Xinjian Li

Shinji Watanabe

Shinnosuke Takamichi

...

2024/2/23

Learning to speak from text: Zero-shot multilingual text-to-speech with unsupervised text pretraining

arXiv preprint arXiv:2301.12596

Takaaki Saeki

Soumi Maiti

Xinjian Li

Shinji Watanabe

Shinnosuke Takamichi

...

2023/1/30

Yodas: Youtube-Oriented Dataset for Audio and Speech

Xinjian Li

Shinnosuke Takamichi

Takaaki Saeki

William Chen

Sayaka Shiota

...

2023/12/16

Reproducing whisper-style training using an open-source toolkit and publicly available data

Yifan Peng

Jinchuan Tian

Brian Yan

Dan Berrebbi

Xuankai Chang

...

2023/12/16

Cmu’s iwslt 2023 simultaneous speech translation system

Brian Yan

Jiatong Shi

Soumi Maiti

William Chen

Xinjian Li

...

2023/7

Phone Inventories and Recognition for Every Language

Xinjian Li

Florian Metze

David R Mortensen

Alan W Black

Shinji Watanabe

2022

Textless Direct Speech-to-Speech Translation with Discrete Speech Representation

ICASSP 2023

Xinjian Li

Ye Jia

Chung-Cheng Chiu

2022/10/31

ASR2K: Speech Recognition for Around 2000 Languages without Audio

Interspeech 2022

Xinjian Li

Florian Metze

David R Mortensen

Alan W Black

Shinji Watanabe

2022/9/6

On adversarial robustness of large-scale audio visual learning

Juncheng B Li

Shuhui Qu

Xinjian Li

Po-Yao Bernie Huang

Florian Metze

2022/5/23

Zero-shot learning for grapheme to phoneme conversion with language ensemble

Xinjian Li

Florian Metze

David R Mortensen

Shinji Watanabe

Alan W Black

2022/5

Phoneme Recognition through Fine Tuning of Phonetic Representations: a Case Study on Luhya Language Varieties

Interspeech 2021

Kathleen Siminyu

Xinjian Li

Antonios Anastasopoulos

David Mortensen

Michael R Marlo

...

2021/4/4

On Prosody Modeling for ASR+ TTS based Voice Conversion

ASRU 2021

Wen-Chin Huang

Tomoki Hayashi

Xinjian Li

Shinji Watanabe

Tomoki Toda

2021/7/20

Tusom2021: A Phonetically Transcribed Speech Dataset from an Endangered Language for Universal Phone Recognition Experiments

Interspeech 2021

David R Mortensen

Jordan Picone

Xinjian Li

Kathleen Siminyu

2021/4/2

Multilingual phonetic dataset for low resource speech recognition

Xinjian Li

David R Mortensen

Florian Metze

Alan W Black

2021/6/6

Hierarchical Phone Recognition with Compositional Phonetics

Proc. Interspeech 2021

Xinjian Li

Juncheng Li

Florian Metze

Alan W Black

2021

Phone distribution estimation for low resource languages

Xinjian Li

Juncheng Li

Jiali Yao

Alan W Black

Florian Metze

2021/6/6

Acoustics based intent recognition using discovered phonetic units for low resource languages

Akshat Gupta

Xinjian Li

Sai Krishna Rallabandi

Alan W Black

2021/6/6

A summary of the first workshop on language technology for language documentation and revitalization

arXiv preprint arXiv:2004.13203

Graham Neubig

Shruti Rijhwani

Alexis Palmer

Jordan MacKenzie

Hilaria Cruz

...

2020/4/27

AlloVera: a multilingual allophone database

David R Mortensen

Xinjian Li

Patrick Littell

Alexis Michaud

Shruti Rijhwani

...

2020/4/17

Universal phone recognition with a multilingual allophone system

Xinjian Li

Siddharth Dalmia

Juncheng Li

Matthew Lee

Patrick Littell

...

2020/2/26

See List of Professors in Xinjian Li University(Carnegie Mellon University)

Co-Authors

H-index: 82
Graham Neubig

Graham Neubig

Carnegie Mellon University

H-index: 78
Alan W Black

Alan W Black

Carnegie Mellon University

H-index: 29
Antonis Anastasopoulos

Antonis Anastasopoulos

George Mason University

H-index: 18
Siddharth Dalmia

Siddharth Dalmia

Carnegie Mellon University

H-index: 17
David Mortensen

David Mortensen

Carnegie Mellon University

H-index: 15
Billy li (Juncheng)

Billy li (Juncheng)

Carnegie Mellon University

academic-engine