Devi Parikh

Devi Parikh

Georgia Institute of Technology

H-index: 84

North America-United States

About Devi Parikh

Devi Parikh, With an exceptional h-index of 84 and a recent h-index of 72 (since 2020), a distinguished researcher at Georgia Institute of Technology, specializes in the field of Generative AI, AI for Creativity, Computer Vision, Natural Language Processing.

His recent articles reflect a diverse array of research interests and contributions to the field:

Generating audio files from text input

Video Editing via Factorized Diffusion Distillation

Make-an-animation: Large-scale text-conditional 3d human motion generation

Emu video: Factorizing text-to-video generation by explicit image conditioning

Spatext: Spatio-textual representation for controllable image generation

Emu edit: Precise image editing via recognition and generation tasks

Emu: Enhancing image generation models using photogenic needles in a haystack

Text-conditional contextualized avatars for zero-shot personalization

Devi Parikh Information

University

Position

Facebook AI Research

Citations(all)

64402

Citations(since 2020)

54810

Cited By

25222

hIndex(all)

84

hIndex(since 2020)

72

i10Index(all)

180

i10Index(since 2020)

150

Email

University Profile Page

Georgia Institute of Technology

Google Scholar

View Google Scholar Profile

Devi Parikh Skills & Research Interests

Generative AI

AI for Creativity

Computer Vision

Natural Language Processing

Top articles of Devi Parikh

Title

Journal

Author(s)

Publication Date

Generating audio files from text input

2024/4/4

Video Editing via Factorized Diffusion Distillation

arXiv preprint arXiv:2403.09334

Uriel Singer

Amit Zohar

Yuval Kirstain

Shelly Sheynin

Adam Polyak

...

2024/3/14

Make-an-animation: Large-scale text-conditional 3d human motion generation

Samaneh Azadi

Akbar Shah

Thomas Hayes

Devi Parikh

Sonal Gupta

2023

Emu video: Factorizing text-to-video generation by explicit image conditioning

arXiv preprint arXiv:2311.10709

Rohit Girdhar

Mannat Singh

Andrew Brown

Quentin Duval

Samaneh Azadi

...

2023/11/17

Spatext: Spatio-textual representation for controllable image generation

Omri Avrahami

Thomas Hayes

Oran Gafni

Sonal Gupta

Yaniv Taigman

...

2023

Emu edit: Precise image editing via recognition and generation tasks

arXiv preprint arXiv:2311.10089

Shelly Sheynin

Adam Polyak

Uriel Singer

Yuval Kirstain

Amit Zohar

...

2023/11/16

Emu: Enhancing image generation models using photogenic needles in a haystack

arXiv preprint arXiv:2309.15807

Xiaoliang Dai

Ji Hou

Chih-Yao Ma

Sam Tsai

Jialiang Wang

...

2023/9/27

Text-conditional contextualized avatars for zero-shot personalization

arXiv preprint arXiv:2304.07410

Samaneh Azadi

Thomas Hayes

Akbar Shah

Guan Pang

Devi Parikh

...

2023/4/14

Text-to-4d dynamic scene generation

arXiv preprint arXiv:2301.11280

Uriel Singer

Shelly Sheynin

Adam Polyak

Oron Ashual

Iurii Makarov

...

2023/1/26

Make-a-video: Text-to-video generation without text-video data

arXiv preprint arXiv:2209.14792

Uriel Singer

Adam Polyak

Thomas Hayes

Xi Yin

Jie An

...

2022/9/29

Task-Specific Text Generation Based On Multimodal Inputs

2022/7/14

Mugen: A playground for video-audio-text multimodal understanding and generation

Thomas Hayes

Songyang Zhang

Xi Yin

Guan Pang

Sasha Sheng

...

2022/10/23

Episodic memory question answering

Samyak Datta

Sameer Dharur

Vincent Cartillier

Ruta Desai

Mukul Khanna

...

2022

Long video generation with time-agnostic vqgan and time-sensitive transformer

Songwei Ge

Thomas Hayes

Harry Yang

Xi Yin

Guan Pang

...

2022/10/23

Make-a-scene: Scene-based text-to-image generation with human priors

Oran Gafni

Adam Polyak

Oron Ashual

Shelly Sheynin

Devi Parikh

...

2022/10/23

Audiogen: Textually guided audio generation

arXiv preprint arXiv:2209.15352

Felix Kreuk

Gabriel Synnaeve

Adam Polyak

Uriel Singer

Alexandre Défossez

...

2022/9/30

The Open Catalyst Challenge 2021: Competition Report.

Abhishek Das

Muhammed Shuaibi

Aini Palizhati

Siddharth Goyal

Aditya Grover

...

2021

Visual conceptual blending with large-scale language and vision models

arXiv preprint arXiv:2106.14127

Songwei Ge

Devi Parikh

2021/6/27

Telling creative stories using generative visual aids

arXiv preprint arXiv:2110.14810

Safinah Ali

Devi Parikh

2021/10/27

Vx2text: End-to-end learning of video-based text generation from multimodal inputs

Xudong Lin

Gedas Bertasius

Jue Wang

Shih-Fu Chang

Devi Parikh

...

2021

See List of Professors in Devi Parikh University(Georgia Institute of Technology)

Co-Authors

H-index: 121
Jiebo Luo

Jiebo Luo

University of Rochester

H-index: 73
Tsuhan Chen

Tsuhan Chen

National University of Singapore

H-index: 13
Harsh Agrawal

Harsh Agrawal

Georgia Institute of Technology

H-index: 9
Prithvijit Chattopadhyay

Prithvijit Chattopadhyay

Georgia Institute of Technology

academic-engine