Lorenzo Baraldi
Università degli Studi di Modena e Reggio Emilia
H-index: 28
Europe-Italy
Top articles of Lorenzo Baraldi
Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs
2024
AIGeN: An Adversarial Approach for Instruction Generation in VLN
arXiv preprint arXiv:2404.10054
2024/4/15
Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation
arXiv preprint arXiv:2404.06542
2024/4/9
Roberto Amoroso
H-Index: 0
Marcella Cornia
H-Index: 13
Lorenzo Baraldi
H-Index: 17
Rita Cucchiara
H-Index: 40
Are Learnable Prompts the Right Way of Prompting? Adapting Vision-and-Language Models with Memory Optimization
IEEE Intelligent Systems
2024
The (R) Evolution of Multimodal Large Language Models: A Survey
2024/2/19
Towards Retrieval-Augmented Architectures for Image Captioning
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING, COMMUNICATIONS AND APPLICATIONS
2024
What's Outside the Intersection? Fine-Grained Error Analysis for Semantic Segmentation Beyond IoU
2024
Roberto Amoroso
H-Index: 0
Lorenzo Baraldi
H-Index: 17
Rita Cucchiara
H-Index: 40
Volker Tresp
H-Index: 36
Matthias Schubert
H-Index: 17
FOSSIL: Free Open-Vocabulary Semantic Segmentation Through Synthetic References Retrieval
2024
Mapping High-level Semantic Regions in Indoor Environments without Object Recognition
2024
Roberto Bigazzi
H-Index: 0
Lorenzo Baraldi
H-Index: 17
Shreyas Kousik
H-Index: 7
Rita Cucchiara
H-Index: 40
Marco Pavone
H-Index: 38
Sharing Cultural Heritage—The Case of the Lodovico Media Library
Multimodal Technologies and Interaction
2023/12/5
Lorenzo Baraldi
H-Index: 17
Generating More Pertinent Captions by Leveraging Semantics and Style on Multi-Source Datasets
International Journal of Computer Vision
2023/12/5
Fully-attentive iterative networks for region-based controllable image and video captioning
Computer Vision and Image Understanding
2023/12/1
Removing NSFW Concepts from Vision-and-Language Models for Text-to-Image Retrieval and Generation
arXiv preprint arXiv:2311.16254
2023/11/27
Samuele Poppi
H-Index: 0
Marcella Cornia
H-Index: 13
Lorenzo Baraldi
H-Index: 17
Rita Cucchiara
H-Index: 40
Let's ViCE! Mimicking Human Cognitive Behavior in Image Generation Evaluation
2023/10/26
Synthcap: Augmenting transformers with synthetic data for image captioning
2023
Enhancing Open-Vocabulary Semantic Segmentation with Prototype Retrieval
2023/9/5
Towards Explainable Navigation and Recounting
2023/9/5
Unveiling the impact of image transformations on deepfake detection: An experimental analysis
2023/9/5
Lorenzo Baraldi
H-Index: 17
Samuele Poppi
H-Index: 0
Marcella Cornia
H-Index: 13
Lorenzo Baraldi
H-Index: 17
Rita Cucchiara
H-Index: 40
Evaluating synthetic pre-Training for handwriting processing tasks
Pattern Recognition Letters
2023/8/1
Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training
arXiv preprint arXiv:2306.07346
2023/6/12
Lorenzo Baraldi
H-Index: 17
Roberto Amoroso
H-Index: 0
Marcella Cornia
H-Index: 13
Rita Cucchiara
H-Index: 40