Ruslan Salakhutdinov
Carnegie Mellon University
H-index: 115
North America-United States
Top articles of Ruslan Salakhutdinov
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Automated Black-box Prompt Engineering for Personalized Text-to-Image Generation | arXiv preprint arXiv:2403.19103 | Yutong He Alexander Robey Naoki Murata Yiding Jiang Joshua Williams | 2024/3/28 |
SPRING: Studying Papers and Reasoning to play Games | Yue Wu So Yeon Min Shrimai Prabhumoye Yonatan Bisk Ruslan Salakhutdinov | 2023 | |
Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference | arXiv preprint arXiv:2403.04082 | Benjamin Eysenbach Vivek Myers Ruslan Salakhutdinov Sergey Levine | 2024/3/6 |
Factorized contrastive learning: Going beyond multi-view redundancy | Advances in Neural Information Processing Systems | Paul Pu Liang Zihao Deng Martin Q Ma James Y Zou Louis-Philippe Morency | 2024/2/13 |
Automatic question-answer generation for long-tail knowledge | arXiv preprint arXiv:2403.01382 | Rohan Kumar Youngmin Kim Sunitha Ravi Haitian Sun Christos Faloutsos | 2024/3/3 |
Generating images with multimodal language models | Advances in Neural Information Processing Systems | Jing Yu Koh Daniel Fried Russ R Salakhutdinov | 2024/2/13 |
OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web | arXiv preprint arXiv:2402.17553 | Raghav Kapoor Yash Parag Butala Melisa Russak Jing Yu Koh Kiran Kamble | 2024/2/27 |
Stylus: Automatic Adapter Selection for Diffusion Models | arXiv preprint arXiv:2404.18928 | Michael Luo Justin Wong Brandon Trabucco Yanping Huang Joseph E Gonzalez | 2024/4/29 |
Visualwebarena: Evaluating multimodal agents on realistic visual web tasks | arXiv preprint arXiv:2401.13649 | Jing Yu Koh Robert Lo Lawrence Jang Vikram Duvvur Ming Chong Lim | 2024/1/24 |
Quantifying & Modeling Multimodal Interactions: An Information Decomposition Framework | Advances in Neural Information Processing Systems | Paul Pu Liang Yun Cheng Xiang Fan Chun Kai Ling Suzanne Nie | 2024/2/13 |
AgentKit: Flow Engineering with Graphs, not Coding | arXiv preprint arXiv:2404.11483 | Yue Wu Yewen Fan So Yeon Min Shrimai Prabhumoye Stephen McAleer | 2024/4/17 |
Imitating task and motion planning with visuomotor transformers | arXiv preprint arXiv:2305.16309 | Murtaza Dalal Ajay Mandlekar Caelan Garrett Ankur Handa Ruslan Salakhutdinov | 2023/5/25 |
Effective data augmentation with diffusion models | arXiv preprint arXiv:2302.07944 | Brandon Trabucco Kyle Doherty Max Gurinas Ruslan Salakhutdinov | 2023/2/7 |
Grounding language models to images for multimodal inputs and outputs | Jing Yu Koh Ruslan Salakhutdinov Daniel Fried | 2023/7/3 | |
Self-Supervised Object Goal Navigation with In-Situ Finetuning | So Yeon Min Yao-Hung Hubert Tsai Wei Ding Ali Farhadi Ruslan Salakhutdinov | 2023/10/1 | |
Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from Offline Data | Chongyi Zheng Benjamin Eysenbach Homer Rich Walke Patrick Yin Kuan Fang | 2023/10/13 | |
Manifold preserving guided diffusion | arXiv preprint arXiv:2311.16424 | Yutong He Naoki Murata Chieh-Hsin Lai Yuhta Takida Toshimitsu Uesaka | 2023/11/28 |
Quantifying & modeling feature interactions: An information decomposition framework | arXiv e-prints | Paul Pu Liang Yun Cheng Xiang Fan Chun Kai Ling Suzanne Nie | 2023/2 |
Spring: Gpt-4 out-performs rl algorithms by studying papers and reasoning | arXiv preprint arXiv:2305.15486 | Yue Wu So Yeon Min Shrimai Prabhumoye Yonatan Bisk Ruslan Salakhutdinov | 2023/5/24 |
Localized text-to-image generation for free via cross attention control | arXiv preprint arXiv:2306.14636 | Yutong He Ruslan Salakhutdinov J Zico Kolter | 2023/6/26 |