ProfessorsProfessors of King Abdullah University of Science and TechnologyMohamed Elhoseiny, Ph.D.

Mohamed Elhoseiny, Ph.D.

King Abdullah University of Science and Technology

H-index: 36

Asia-Saudi Arabia

About Mohamed Elhoseiny, Ph.D.

Mohamed Elhoseiny, Ph.D., With an exceptional h-index of 36 and a recent h-index of 33 (since 2020), a distinguished researcher at King Abdullah University of Science and Technology, specializes in the field of Zero-Shot Learning, Few-Shot Learning, Computer Vision, Computational Creativity, Vision and Language.

His recent articles reflect a diverse array of research interests and contributions to the field:

A Hybrid Graph Network for Complex Activity Detection in Video

Continual Learning on a Diet: Learning from Sparsely Labeled Streams Under Constrained Computation

MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens

ImageCaptioner2: Image Captioner for Image Captioning Bias Amplification Assessment

AI Art Neural Constellation: Revealing the Collective and Contrastive State of AI-Generated and Human Art

Efficiently Disentangle Causal Representations

Mammalnet: A large-scale video benchmark for mammal recognition and behavior understanding

OxfordTVG-HIC: Can Machine Make Humorous Captions from Images?

Mohamed Elhoseiny, Ph.D. Information

University	King Abdullah University of Science and Technology
Position	Assistant Professor (hiring postdoc & grad
Citations(all)	9237
Citations(since 2020)	8501
Cited By	2688
hIndex(all)	36
hIndex(since 2020)	33
i10Index(all)	55
i10Index(since 2020)	52
Email	Access Email
University Profile Page	King Abdullah University of Science and Technology
Google Scholar	View Google Scholar Profile

Mohamed Elhoseiny, Ph.D. Skills & Research Interests

Zero-Shot Learning

Few-Shot Learning

Computer Vision

Computational Creativity

Vision and Language

Top articles of Mohamed Elhoseiny, Ph.D.

Title	Journal	Author(s)	Publication Date
A Hybrid Graph Network for Complex Activity Detection in Video		Salman Khan Izzeddin Teeti Andrew Bradley Mohamed Elhoseiny Fabio Cuzzolin	2024
Continual Learning on a Diet: Learning from Sparsely Labeled Streams Under Constrained Computation	arXiv preprint arXiv:2404.12766	Wenxuan Zhang Youssef Mohamed Bernard Ghanem Philip HS Torr Adel Bibi ...	2024/4/19
MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens	arXiv preprint arXiv:2404.03413	Kirolos Ataallah Xiaoqian Shen Eslam Abdelrahman Essam Sleiman Deyao Zhu ...	2024/4/4
ImageCaptioner2: Image Captioner for Image Captioning Bias Amplification Assessment	Proceedings of the AAAI Conference on Artificial Intelligence	Eslam Abdelrahman Pengzhan Sun Li Erran Li Mohamed Elhoseiny	2024/3/24
AI Art Neural Constellation: Revealing the Collective and Contrastive State of AI-Generated and Human Art	arXiv preprint arXiv:2402.02453	Faizan Farooq Khan Diana Kim Divyansh Jha Youssef Mohamed Hanna H Chang ...	2024/2/4
Efficiently Disentangle Causal Representations		Yuanpeng Li Joel Hestness Mohamed Elhoseiny Liang Zhao Kenneth Church	2024/1/8
Mammalnet: A large-scale video benchmark for mammal recognition and behavior understanding		Jun Chen Ming Hu Darren J Coker Michael L Berumen Blair Costelloe ...	2023
OxfordTVG-HIC: Can Machine Make Humorous Captions from Images?		Runjia Li Shuyang Sun Mohamed Elhoseiny Philip Torr	2023
Minigpt-4: Enhancing vision-language understanding with advanced large language models	arXiv preprint arXiv:2304.10592 (ICLR2024 paper)	Deyao Zhu Jun Chen Xiaoqian Shen Xiang Li Mohamed Elhoseiny	2023/4/20
Continual Zero-Shot Learning through Semantically Guided Generative Random Walks		Wenxuan Zhang Paul Janson Kai Yi Ivan Skorokhodov Mohamed Elhoseiny	2023
3DCoMPaT: An improved Large-scale 3D Vision Dataset for Compositional Recognition	arXiv preprint arXiv:2310.18511	Habib Slim Xiang Li Yuchen Li Mahmoud Ahmed Mohamed Ayman ...	2023/10/27
Vision-CAIR/ChatCaptioner: Official Repository of ChatCaptioner		Jun Chen Deyao Zhu Kilichbek Haydarov Xiang Li Mohamed Elhoseiny	2023/3/10
Uni3DL: Unified Model for 3D and Language Understanding		Xiang Li Jian Ding Zhaoyang Chen Mohamed Elhoseiny	2023/12/5
ImageCaptioner: Image Captioner for Image Captioning Bias Amplification Assessment	arXiv preprint arXiv:2304.04874	Eslam Mohamed Bakr Pengzhan Sun Li Erran Li Mohamed Elhoseiny	2023/4/10
Exploring Open-Vocabulary Semantic Segmentation without Human Labels	arXiv preprint arXiv:2306.00450	Jun Chen Deyao Zhu Guocheng Qian Bernard Ghanem Zhicheng Yan ...	2023/6/1
Aberration-aware depth-from-focus	IEEE Transactions on Pattern Analysis and Machine Intelligence	Xinge Yang Qiang Fu Mohamed Elhoseiny Wolfgang Heidrich	2023/8/4
Minigpt-v2: large language model as a unified interface for vision-language multi-task learning	arXiv preprint arXiv:2310.09478	Jun Chen Deyao Zhu Xiaoqian Shen Xiang Li Zechun Liu ...	2023/10
Guiding online reinforcement learning with action-free offline pretraining	arXiv preprint arXiv:2301.12876	Deyao Zhu Yuhui Wang Jürgen Schmidhuber Mohamed Elhoseiny	2023/1/30
Video chatcaptioner: Towards the enriched spatiotemporal descriptions	arXiv preprint arXiv:2304.04227	Jun Chen Deyao Zhu Kilichbek Haydarov Xiang Li Mohamed Elhoseiny	2023/4/9
StoryGPT-V: Large Language Models as Consistent Story Visualizers		Xiaoqian Shen Mohamed Elhoseiny	2023/12/4