Mohamed Elhoseiny, Ph.D.

About Mohamed Elhoseiny, Ph.D.

Mohamed Elhoseiny, Ph.D., With an exceptional h-index of 36 and a recent h-index of 33 (since 2020), a distinguished researcher at King Abdullah University of Science and Technology, specializes in the field of Zero-Shot Learning, Few-Shot Learning, Computer Vision, Computational Creativity, Vision and Language.

His recent articles reflect a diverse array of research interests and contributions to the field:

A Hybrid Graph Network for Complex Activity Detection in Video

Continual Learning on a Diet: Learning from Sparsely Labeled Streams Under Constrained Computation

MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens

ImageCaptioner2: Image Captioner for Image Captioning Bias Amplification Assessment

AI Art Neural Constellation: Revealing the Collective and Contrastive State of AI-Generated and Human Art

Efficiently Disentangle Causal Representations

Mammalnet: A large-scale video benchmark for mammal recognition and behavior understanding

OxfordTVG-HIC: Can Machine Make Humorous Captions from Images?

Mohamed Elhoseiny, Ph.D. Information

University

Position

Assistant Professor (hiring postdoc & grad

Citations(all)

9237

Citations(since 2020)

8501

Cited By

2688

hIndex(all)

36

hIndex(since 2020)

33

i10Index(all)

55

i10Index(since 2020)

52

Email

University Profile Page

King Abdullah University of Science and Technology

Google Scholar

View Google Scholar Profile

Mohamed Elhoseiny, Ph.D. Skills & Research Interests

Zero-Shot Learning

Few-Shot Learning

Computer Vision

Computational Creativity

Vision and Language

Top articles of Mohamed Elhoseiny, Ph.D.

Title

Journal

Author(s)

Publication Date

A Hybrid Graph Network for Complex Activity Detection in Video

Salman Khan

Izzeddin Teeti

Andrew Bradley

Mohamed Elhoseiny

Fabio Cuzzolin

2024

Continual Learning on a Diet: Learning from Sparsely Labeled Streams Under Constrained Computation

arXiv preprint arXiv:2404.12766

Wenxuan Zhang

Youssef Mohamed

Bernard Ghanem

Philip HS Torr

Adel Bibi

...

2024/4/19

MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens

arXiv preprint arXiv:2404.03413

Kirolos Ataallah

Xiaoqian Shen

Eslam Abdelrahman

Essam Sleiman

Deyao Zhu

...

2024/4/4

ImageCaptioner2: Image Captioner for Image Captioning Bias Amplification Assessment

Proceedings of the AAAI Conference on Artificial Intelligence

Eslam Abdelrahman

Pengzhan Sun

Li Erran Li

Mohamed Elhoseiny

2024/3/24

AI Art Neural Constellation: Revealing the Collective and Contrastive State of AI-Generated and Human Art

arXiv preprint arXiv:2402.02453

Faizan Farooq Khan

Diana Kim

Divyansh Jha

Youssef Mohamed

Hanna H Chang

...

2024/2/4

Efficiently Disentangle Causal Representations

Yuanpeng Li

Joel Hestness

Mohamed Elhoseiny

Liang Zhao

Kenneth Church

2024/1/8

Mammalnet: A large-scale video benchmark for mammal recognition and behavior understanding

Jun Chen

Ming Hu

Darren J Coker

Michael L Berumen

Blair Costelloe

...

2023

OxfordTVG-HIC: Can Machine Make Humorous Captions from Images?

Runjia Li

Shuyang Sun

Mohamed Elhoseiny

Philip Torr

2023

Minigpt-4: Enhancing vision-language understanding with advanced large language models

arXiv preprint arXiv:2304.10592 (ICLR2024 paper)

Deyao Zhu

Jun Chen

Xiaoqian Shen

Xiang Li

Mohamed Elhoseiny

2023/4/20

Continual Zero-Shot Learning through Semantically Guided Generative Random Walks

Wenxuan Zhang

Paul Janson

Kai Yi

Ivan Skorokhodov

Mohamed Elhoseiny

2023

3DCoMPaT: An improved Large-scale 3D Vision Dataset for Compositional Recognition

arXiv preprint arXiv:2310.18511

Habib Slim

Xiang Li

Yuchen Li

Mahmoud Ahmed

Mohamed Ayman

...

2023/10/27

Vision-CAIR/ChatCaptioner: Official Repository of ChatCaptioner

Jun Chen

Deyao Zhu

Kilichbek Haydarov

Xiang Li

Mohamed Elhoseiny

2023/3/10

Uni3DL: Unified Model for 3D and Language Understanding

Xiang Li

Jian Ding

Zhaoyang Chen

Mohamed Elhoseiny

2023/12/5

ImageCaptioner: Image Captioner for Image Captioning Bias Amplification Assessment

arXiv preprint arXiv:2304.04874

Eslam Mohamed Bakr

Pengzhan Sun

Li Erran Li

Mohamed Elhoseiny

2023/4/10

Exploring Open-Vocabulary Semantic Segmentation without Human Labels

arXiv preprint arXiv:2306.00450

Jun Chen

Deyao Zhu

Guocheng Qian

Bernard Ghanem

Zhicheng Yan

...

2023/6/1

Aberration-aware depth-from-focus

IEEE Transactions on Pattern Analysis and Machine Intelligence

Xinge Yang

Qiang Fu

Mohamed Elhoseiny

Wolfgang Heidrich

2023/8/4

Minigpt-v2: large language model as a unified interface for vision-language multi-task learning

arXiv preprint arXiv:2310.09478

Jun Chen

Deyao Zhu

Xiaoqian Shen

Xiang Li

Zechun Liu

...

2023/10

Guiding online reinforcement learning with action-free offline pretraining

arXiv preprint arXiv:2301.12876

Deyao Zhu

Yuhui Wang

Jürgen Schmidhuber

Mohamed Elhoseiny

2023/1/30

Video chatcaptioner: Towards the enriched spatiotemporal descriptions

arXiv preprint arXiv:2304.04227

Jun Chen

Deyao Zhu

Kilichbek Haydarov

Xiang Li

Mohamed Elhoseiny

2023/4/9

StoryGPT-V: Large Language Models as Consistent Story Visualizers

Xiaoqian Shen

Mohamed Elhoseiny

2023/12/4

See List of Professors in Mohamed Elhoseiny, Ph.D. University(King Abdullah University of Science and Technology)

Co-Authors

H-index: 131
Philip Torr

Philip Torr

University of Oxford

H-index: 77
Tinne Tuytelaars

Tinne Tuytelaars

Katholieke Universiteit Leuven

H-index: 56
Ahmed Elgammal

Ahmed Elgammal

Rutgers, The State University of New Jersey

H-index: 27
Boyang "Albert" Li

Boyang "Albert" Li

Nanyang Technological University

H-index: 14
Panos Achlioptas

Panos Achlioptas

Stanford University

H-index: 13
Ivan Skorokhodov

Ivan Skorokhodov

Moscow Institute of Physics and Technology

academic-engine