Mohamed Elhoseiny, Ph.D.
King Abdullah University of Science and Technology
H-index: 36
Asia-Saudi Arabia
Top articles of Mohamed Elhoseiny, Ph.D.
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
A Hybrid Graph Network for Complex Activity Detection in Video | Salman Khan Izzeddin Teeti Andrew Bradley Mohamed Elhoseiny Fabio Cuzzolin | 2024 | |
Continual Learning on a Diet: Learning from Sparsely Labeled Streams Under Constrained Computation | arXiv preprint arXiv:2404.12766 | Wenxuan Zhang Youssef Mohamed Bernard Ghanem Philip HS Torr Adel Bibi | 2024/4/19 |
MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens | arXiv preprint arXiv:2404.03413 | Kirolos Ataallah Xiaoqian Shen Eslam Abdelrahman Essam Sleiman Deyao Zhu | 2024/4/4 |
ImageCaptioner2: Image Captioner for Image Captioning Bias Amplification Assessment | Proceedings of the AAAI Conference on Artificial Intelligence | Eslam Abdelrahman Pengzhan Sun Li Erran Li Mohamed Elhoseiny | 2024/3/24 |
AI Art Neural Constellation: Revealing the Collective and Contrastive State of AI-Generated and Human Art | arXiv preprint arXiv:2402.02453 | Faizan Farooq Khan Diana Kim Divyansh Jha Youssef Mohamed Hanna H Chang | 2024/2/4 |
Efficiently Disentangle Causal Representations | Yuanpeng Li Joel Hestness Mohamed Elhoseiny Liang Zhao Kenneth Church | 2024/1/8 | |
Mammalnet: A large-scale video benchmark for mammal recognition and behavior understanding | Jun Chen Ming Hu Darren J Coker Michael L Berumen Blair Costelloe | 2023 | |
OxfordTVG-HIC: Can Machine Make Humorous Captions from Images? | Runjia Li Shuyang Sun Mohamed Elhoseiny Philip Torr | 2023 | |
Minigpt-4: Enhancing vision-language understanding with advanced large language models | arXiv preprint arXiv:2304.10592 (ICLR2024 paper) | Deyao Zhu Jun Chen Xiaoqian Shen Xiang Li Mohamed Elhoseiny | 2023/4/20 |
Continual Zero-Shot Learning through Semantically Guided Generative Random Walks | Wenxuan Zhang Paul Janson Kai Yi Ivan Skorokhodov Mohamed Elhoseiny | 2023 | |
3DCoMPaT: An improved Large-scale 3D Vision Dataset for Compositional Recognition | arXiv preprint arXiv:2310.18511 | Habib Slim Xiang Li Yuchen Li Mahmoud Ahmed Mohamed Ayman | 2023/10/27 |
Vision-CAIR/ChatCaptioner: Official Repository of ChatCaptioner | Jun Chen Deyao Zhu Kilichbek Haydarov Xiang Li Mohamed Elhoseiny | 2023/3/10 | |
Uni3DL: Unified Model for 3D and Language Understanding | Xiang Li Jian Ding Zhaoyang Chen Mohamed Elhoseiny | 2023/12/5 | |
ImageCaptioner: Image Captioner for Image Captioning Bias Amplification Assessment | arXiv preprint arXiv:2304.04874 | Eslam Mohamed Bakr Pengzhan Sun Li Erran Li Mohamed Elhoseiny | 2023/4/10 |
Exploring Open-Vocabulary Semantic Segmentation without Human Labels | arXiv preprint arXiv:2306.00450 | Jun Chen Deyao Zhu Guocheng Qian Bernard Ghanem Zhicheng Yan | 2023/6/1 |
Aberration-aware depth-from-focus | IEEE Transactions on Pattern Analysis and Machine Intelligence | Xinge Yang Qiang Fu Mohamed Elhoseiny Wolfgang Heidrich | 2023/8/4 |
Minigpt-v2: large language model as a unified interface for vision-language multi-task learning | arXiv preprint arXiv:2310.09478 | Jun Chen Deyao Zhu Xiaoqian Shen Xiang Li Zechun Liu | 2023/10 |
Guiding online reinforcement learning with action-free offline pretraining | arXiv preprint arXiv:2301.12876 | Deyao Zhu Yuhui Wang Jürgen Schmidhuber Mohamed Elhoseiny | 2023/1/30 |
Video chatcaptioner: Towards the enriched spatiotemporal descriptions | arXiv preprint arXiv:2304.04227 | Jun Chen Deyao Zhu Kilichbek Haydarov Xiang Li Mohamed Elhoseiny | 2023/4/9 |
StoryGPT-V: Large Language Models as Consistent Story Visualizers | Xiaoqian Shen Mohamed Elhoseiny | 2023/12/4 |