Mitesh Khapra

About Mitesh Khapra

Mitesh Khapra, With an exceptional h-index of 34 and a recent h-index of 32 (since 2020), a distinguished researcher at Indian Institute of Technology Madras, specializes in the field of Natural Language Processing, Machine Learning.

His recent articles reflect a diverse array of research interests and contributions to the field:

An Empirical Analysis of In-context Learning Abilities of LLMs for MT

A Comprehensive Analysis of Adapter Efficiency

IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages

IndicVoices: Towards building an Inclusive Multilingual Speech Dataset for Indian Languages

Airavata: Introducing Hindi Instruction-tuned LLM

Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR

Towards building text-to-speech systems for the next billion users

Effectiveness of mining audio and text pairs from public data for improving ASR systems for low-resource languages

Mitesh Khapra Information

University

Position

___

Citations(all)

4862

Citations(since 2020)

3831

Cited By

2080

hIndex(all)

34

hIndex(since 2020)

32

i10Index(all)

80

i10Index(since 2020)

58

Email

University Profile Page

Indian Institute of Technology Madras

Google Scholar

View Google Scholar Profile

Mitesh Khapra Skills & Research Interests

Natural Language Processing

Machine Learning

Top articles of Mitesh Khapra

Title

Journal

Author(s)

Publication Date

An Empirical Analysis of In-context Learning Abilities of LLMs for MT

arXiv preprint arXiv:2401.12097

Pranjal A Chitale

Jay Gala

Varun Gumma

Mitesh M Khapra

Raj Dabre

2024/1/22

A Comprehensive Analysis of Adapter Efficiency

Nandini Mundra

Sumanth Doddapaneni

Raj Dabre

Anoop Kunchukuttan

Ratish Puduppully

...

2024/1/4

IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages

arXiv preprint arXiv:2403.06350

Mohammed Safi Ur Rahman Khan

Priyam Mehta

Ananth Sankar

Umashankar Kumaravelan

Sumanth Doddapaneni

...

2024/3/11

IndicVoices: Towards building an Inclusive Multilingual Speech Dataset for Indian Languages

arXiv preprint arXiv:2403.01926

Tahir Javed

Janki Atul Nawale

Eldho Ittan George

Sakshi Joshi

Kaushal Santosh Bhogale

...

2024/3/4

Airavata: Introducing Hindi Instruction-tuned LLM

arXiv preprint arXiv:2401.15006

Jay Gala

Thanmay Jayakumar

Jaavid Aktar Husain

Mohammed Safi Ur Rahman Khan

Diptesh Kanojia

...

2024/1/26

Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR

arXiv preprint arXiv:2305.15386

Kaushal Santosh Bhogale

Sai Sundaresan

Abhigyan Raman

Tahir Javed

Mitesh M Khapra

...

2023/5/24

Towards building text-to-speech systems for the next billion users

Gokul Karthik Kumar

SV Praveen

Pratyush Kumar

Mitesh M Khapra

Karthik Nandakumar

2023/6/4

Effectiveness of mining audio and text pairs from public data for improving ASR systems for low-resource languages

Kaushal Bhogale

Abhigyan Raman

Tahir Javed

Sumanth Doddapaneni

Anoop Kunchukuttan

...

2023/6/4

Aksharantar: Open Indic-language transliteration datasets and models for the next billion users

Yash Madhani

Sushane Parthan

Priyanka Bedekar

Gokul Nc

Ruchi Khapra

...

2023/12

Bhasha-Abhijnaanam: Native-script and romanized Language Identification for 22 Indic languages

arXiv preprint arXiv:2305.15814

Yash Madhani

Mitesh M Khapra

Anoop Kunchukuttan

2023/5/25

A survey of adversarial defenses and robustness in nlp

Shreya Goyal

Sumanth Doddapaneni

Mitesh M Khapra

Balaraman Ravindran

2023/7/17

Svarah: Evaluating English ASR Systems on Indian Accents

arXiv preprint arXiv:2305.15760

Tahir Javed

Sakshi Joshi

Vignesh Nagarajan

Sai Sundaresan

Janki Nawale

...

2023/5/25

IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation Metrics for Indian Languages

Tanay Dixit

Vignesh Nagarajan

Anoop Kunchukuttan

Pratyush Kumar

Mitesh M Khapra

...

2023/7

Indictrans2: Towards high-quality and accessible machine translation models for all 22 scheduled indian languages

arXiv preprint arXiv:2305.16307

Jay Gala

Pranjal A Chitale

Raghavan AK

Sumanth Doddapaneni

Varun Gumma

...

2023/5/25

Indicsuperb: A speech processing universal performance benchmark for indian languages

Proceedings of the AAAI Conference on Artificial Intelligence

Tahir Javed

Kaushal Bhogale

Abhigyan Raman

Pratyush Kumar

Anoop Kunchukuttan

...

2023/6/26

Towards building asr systems for the next billion users

Proceedings of the AAAI Conference on Artificial Intelligence

Tahir Javed

Sumanth Doddapaneni

Abhigyan Raman

Kaushal Santosh Bhogale

Gowtham Ramesh

...

2022/6/28

A survey of evaluation metrics used for NLG systems

Ananya B Sai

Akash Kumar Mohankumar

Mitesh M Khapra

2022/1/18

Aksharantar: Towards building open transliteration tools for the next billion users

arXiv preprint arXiv:2205.03018

Yash Madhani

Sushane Parthan

Priyanka Bedekar

Ruchi Khapra

Vivek Seshadri

...

2022/5/6

IndicNLG benchmark: Multilingual datasets for diverse NLG tasks in Indic languages

arXiv preprint arXiv:2203.05437

Aman Kumar

Himani Shrotriya

Prachi Sahu

Raj Dabre

Ratish Puduppully

...

2022/3/10

Scaling Graph Propagation Kernels for Predictive Learning

Frontiers in big Data

Priyesh Vijayan

Yash Chandak

Mitesh M Khapra

Srinivasan Parthasarathy

Balaraman Ravindran

2022/4/8

See List of Professors in Mitesh Khapra University(Indian Institute of Technology Madras)

Co-Authors

H-index: 55
Dr. Pushpak Bhattacharyya

Dr. Pushpak Bhattacharyya

Indian Institute of Technology Bombay

H-index: 43
Balaraman Ravindran

Balaraman Ravindran

Indian Institute of Technology Madras

H-index: 26
Pratyush Kumar

Pratyush Kumar

Indian Institute of Technology Madras

H-index: 11
Preksha Nema

Preksha Nema

Indian Institute of Technology Madras

H-index: 11
Lena Dankin

Lena Dankin

Tel Aviv University

H-index: 10
Janarthanan Rajendran

Janarthanan Rajendran

University of Michigan

academic-engine