Shivam Mehta at Kungliga Tekniska högskolan

University	Kungliga Tekniska högskolan
Position	___
Citations(all)	55
Citations(since 2020)	54
Cited By	1
hIndex(all)	5
hIndex(since 2020)	5
i10Index(all)	1
i10Index(since 2020)	1
Email	Access Email
University Profile Page	Kungliga Tekniska högskolan
Google Scholar	View Google Scholar Profile

Fake it to make it: Using synthetic data to remedy the data shortage in joint multimodal speech-and-gesture synthesis

arXiv preprint arXiv:2404.19622

2024/4/30

Shivam Mehta

H-Index: 1

Anna Deichler

H-Index: 1

Jim O'Regan

H-Index: 3

Birger Moëll

H-Index: 2

Gustav Eje Henter

H-Index: 15

Simon Alexanderson

H-Index: 9

Unified speech and gesture synthesis using flow matching

2024/4/14

Shivam Mehta

H-Index: 1

Ruibo Tu

H-Index: 3

Simon Alexanderson

H-Index: 9

Gustav Eje Henter

H-Index: 15

OverFlow: Putting flows on top of neural transducers for better TTS

Proc. INTERSPEECH 2023

2023

Shivam Mehta

H-Index: 1

Ambika Kirkland

H-Index: 2

Gustav Eje Henter

H-Index: 15

Diffusion-based co-speech gesture generation using joint text and audio representation

2023/10/9

Anna Deichler

H-Index: 1

Shivam Mehta

H-Index: 1

Simon Alexanderson

H-Index: 9

Matcha-TTS: A fast TTS architecture with conditional flow matching

2024/4/14

Shivam Mehta

H-Index: 1

Ruibo Tu

H-Index: 3

Gustav Eje Henter

H-Index: 15

Stuck in the MOS pit: A critical analysis of MOS test methodology in TTS evaluation

2023/6/15

Ambika Kirkland

H-Index: 2

Shivam Mehta

H-Index: 1

Gustav Eje Henter

H-Index: 15

Joakim Gustafson

H-Index: 14

Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis

arXiv preprint arXiv:2306.09417

2023/6/15

Shivam Mehta

H-Index: 1

Siyang Wang

H-Index: 2

Simon Alexanderson

H-Index: 9

Gustav Eje Henter

H-Index: 15

Prosody-controllable spontaneous TTS with neural HMMs

2023/6/4

Shivam Mehta

H-Index: 1

Gustav Eje Henter

H-Index: 15

Joakim Gustafson

H-Index: 14

Stereotypical nationality representations in HRI: perspectives from international young adults

Frontiers in Robotics and AI

2023

Ronald Cumbal

H-Index: 1

Shivam Mehta

H-Index: 1

Olov Engwall

H-Index: 12

Neural HMMs are all you need (for high-quality attention-free TTS)

2022/5/23

Shivam Mehta

H-Index: 1

Gustav Eje Henter

H-Index: 15

Learning fast with fewer data samples using Neural HMMs

2022

Shivam Mehta

H-Index: 1

Gustav Eje Henter

H-Index: 15

Speech data augmentation for improving phoneme transcriptions of aphasic speech using wav2vec 2.0 for the psst challenge

2022/6

Birger Moëll

H-Index: 2

Jim O'Regan

H-Index: 3

Shivam Mehta

H-Index: 1

Ambika Kirkland

H-Index: 2

Joakim Gustafson

H-Index: 14

Spontaneous Neural HMM TTS with Prosodic Feature Modification

2022

Shivam Mehta

H-Index: 1

Gustav Eje Henter

H-Index: 15

Ambika Kirkland

H-Index: 2

Birger Moëll

H-Index: 2

Jim O'Regan

H-Index: 3

Finding the Blank with Sequence Labeling for English Learning

2020/10/27

Shivam Mehta

H-Index: 1

Ivan Smetannikov

H-Index: 4

Shivam Mehta

Kungliga Tekniska högskolan

About Shivam Mehta

Shivam Mehta Information

Shivam Mehta Skills & Research Interests

Top articles of Shivam Mehta

Fake it to make it: Using synthetic data to remedy the data shortage in joint multimodal speech-and-gesture synthesis

Shivam Mehta

Anna Deichler

Jim O'Regan

Birger Moëll

Gustav Eje Henter

Simon Alexanderson

Unified speech and gesture synthesis using flow matching

Shivam Mehta

Ruibo Tu

Simon Alexanderson

Gustav Eje Henter

OverFlow: Putting flows on top of neural transducers for better TTS

Shivam Mehta

Ambika Kirkland

Gustav Eje Henter

Diffusion-based co-speech gesture generation using joint text and audio representation

Anna Deichler

Shivam Mehta

Simon Alexanderson

Matcha-TTS: A fast TTS architecture with conditional flow matching

Shivam Mehta

Ruibo Tu

Gustav Eje Henter

Stuck in the MOS pit: A critical analysis of MOS test methodology in TTS evaluation

Ambika Kirkland

Shivam Mehta

Gustav Eje Henter

Joakim Gustafson

Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis

Shivam Mehta

Siyang Wang

Simon Alexanderson

Gustav Eje Henter

Prosody-controllable spontaneous TTS with neural HMMs

Shivam Mehta

Gustav Eje Henter

Joakim Gustafson

Stereotypical nationality representations in HRI: perspectives from international young adults

Ronald Cumbal

Shivam Mehta

Olov Engwall

Neural HMMs are all you need (for high-quality attention-free TTS)

Shivam Mehta

Gustav Eje Henter

Learning fast with fewer data samples using Neural HMMs

Shivam Mehta

Gustav Eje Henter

Speech data augmentation for improving phoneme transcriptions of aphasic speech using wav2vec 2.0 for the psst challenge

Birger Moëll

Jim O'Regan

Shivam Mehta

Ambika Kirkland

Joakim Gustafson

Spontaneous Neural HMM TTS with Prosodic Feature Modification

Shivam Mehta

Gustav Eje Henter

Ambika Kirkland

Birger Moëll

Jim O'Regan

Finding the Blank with Sequence Labeling for English Learning

Shivam Mehta

Ivan Smetannikov

Co-Authors

joakim gustafson

Olov Engwall

Gustav Eje Henter

Dr. Eva Szekely

Dr. Gaurav Raj

Simon Alexanderson