Devi Parikh
Georgia Institute of Technology
H-index: 84
North America-United States
Top articles of Devi Parikh
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Generating audio files from text input | 2024/4/4 | ||
Video Editing via Factorized Diffusion Distillation | arXiv preprint arXiv:2403.09334 | Uriel Singer Amit Zohar Yuval Kirstain Shelly Sheynin Adam Polyak | 2024/3/14 |
Make-an-animation: Large-scale text-conditional 3d human motion generation | Samaneh Azadi Akbar Shah Thomas Hayes Devi Parikh Sonal Gupta | 2023 | |
Emu video: Factorizing text-to-video generation by explicit image conditioning | arXiv preprint arXiv:2311.10709 | Rohit Girdhar Mannat Singh Andrew Brown Quentin Duval Samaneh Azadi | 2023/11/17 |
Spatext: Spatio-textual representation for controllable image generation | Omri Avrahami Thomas Hayes Oran Gafni Sonal Gupta Yaniv Taigman | 2023 | |
Emu edit: Precise image editing via recognition and generation tasks | arXiv preprint arXiv:2311.10089 | Shelly Sheynin Adam Polyak Uriel Singer Yuval Kirstain Amit Zohar | 2023/11/16 |
Emu: Enhancing image generation models using photogenic needles in a haystack | arXiv preprint arXiv:2309.15807 | Xiaoliang Dai Ji Hou Chih-Yao Ma Sam Tsai Jialiang Wang | 2023/9/27 |
Text-conditional contextualized avatars for zero-shot personalization | arXiv preprint arXiv:2304.07410 | Samaneh Azadi Thomas Hayes Akbar Shah Guan Pang Devi Parikh | 2023/4/14 |
Text-to-4d dynamic scene generation | arXiv preprint arXiv:2301.11280 | Uriel Singer Shelly Sheynin Adam Polyak Oron Ashual Iurii Makarov | 2023/1/26 |
Make-a-video: Text-to-video generation without text-video data | arXiv preprint arXiv:2209.14792 | Uriel Singer Adam Polyak Thomas Hayes Xi Yin Jie An | 2022/9/29 |
Task-Specific Text Generation Based On Multimodal Inputs | 2022/7/14 | ||
Mugen: A playground for video-audio-text multimodal understanding and generation | Thomas Hayes Songyang Zhang Xi Yin Guan Pang Sasha Sheng | 2022/10/23 | |
Episodic memory question answering | Samyak Datta Sameer Dharur Vincent Cartillier Ruta Desai Mukul Khanna | 2022 | |
Long video generation with time-agnostic vqgan and time-sensitive transformer | Songwei Ge Thomas Hayes Harry Yang Xi Yin Guan Pang | 2022/10/23 | |
Make-a-scene: Scene-based text-to-image generation with human priors | Oran Gafni Adam Polyak Oron Ashual Shelly Sheynin Devi Parikh | 2022/10/23 | |
Audiogen: Textually guided audio generation | arXiv preprint arXiv:2209.15352 | Felix Kreuk Gabriel Synnaeve Adam Polyak Uriel Singer Alexandre Défossez | 2022/9/30 |
The Open Catalyst Challenge 2021: Competition Report. | Abhishek Das Muhammed Shuaibi Aini Palizhati Siddharth Goyal Aditya Grover | 2021 | |
Visual conceptual blending with large-scale language and vision models | arXiv preprint arXiv:2106.14127 | Songwei Ge Devi Parikh | 2021/6/27 |
Telling creative stories using generative visual aids | arXiv preprint arXiv:2110.14810 | Safinah Ali Devi Parikh | 2021/10/27 |
Vx2text: End-to-end learning of video-based text generation from multimodal inputs | Xudong Lin Gedas Bertasius Jue Wang Shih-Fu Chang Devi Parikh | 2021 |