Yu-Gang Jiang
Fudan University
H-index: 78
Asia-China
Top articles of Yu-Gang Jiang
LRANet: Towards Accurate and Efficient Scene Text Detection with Low-Rank Approximation Network
Proceedings of the AAAI Conference on Artificial Intelligence
2024/3/24
Instance-aware multi-camera 3d object detection with structural priors mining and self-boosting learning
Proceedings of the AAAI Conference on Artificial Intelligence
2024/3/24
Yang Jiao
H-Index: 1
Shaoxiang Chen
H-Index: 7
Jingjing Chen
H-Index: 4
Lin Ma
H-Index: 23
Yu-Gang Jiang
H-Index: 45
Nuscenes-qa: A multi-modal visual question answering benchmark for autonomous driving scenario
Proceedings of the AAAI Conference on Artificial Intelligence
2024/3/24
Fdgaussian: Fast gaussian splatting from single image via geometric-aware diffusion model
arXiv preprint arXiv:2403.10242
2024/3/15
Zuxuan Wu
H-Index: 22
Yu-Gang Jiang
H-Index: 45
Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models
arXiv preprint arXiv:2403.07304
2024/3/12
Yang Jiao
H-Index: 1
Shaoxiang Chen
H-Index: 7
Jingjing Chen
H-Index: 4
Lin Ma
H-Index: 23
Yu-Gang Jiang
H-Index: 45
Doubly Abductive Counterfactual Inference for Text-based Image Editing
arXiv preprint arXiv:2403.02981
2024/3/5
Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
arXiv preprint arXiv:2310.12152
2023/10/18
Multi-prompt alignment for multi-source unsupervised domain adaptation
arXiv preprint arXiv:2209.15210
2022/9/30
Cdistnet: Perceiving multi-domain character distance for robust text recognition
International Journal of Computer Vision
2024/2
Instruction-Guided Scene Text Recognition
arXiv preprint arXiv:2401.17851
2024/1/31
Zhineng Chen
H-Index: 13
Yu-Gang Jiang
H-Index: 45
MouSi: Poly-Visual-Expert Vision-Language Models
arXiv preprint arXiv:2401.17221
2024/1/30
Building an Open-Vocabulary Video CLIP Model With Better Architectures, Optimization and Data
arXiv preprint arXiv:2310.05010
2023/10/8
Identity-Driven Multimedia Forgery Detection via Reference Assistance
arXiv preprint arXiv:2401.11764
2024/1/22
GaussianBody: Clothed Human Reconstruction via 3d Gaussian Splatting
arXiv preprint arXiv:2401.09720
2024/1/18
Secrets of rlhf in large language models part ii: Reward modeling
arXiv preprint arXiv:2401.06080
2024/1/11
PoseAnimate: Zero-shot high fidelity pose controllable character animation
arXiv preprint arXiv:2404.13680
2024/4/21
Eyes Can Deceive: Benchmarking Counterfactual Reasoning Abilities of Multi-modal Large Language Models
arXiv preprint arXiv:2404.12966
2024/4/19
The Dog Walking Theory: Rethinking Convergence in Federated Learning
arXiv preprint arXiv:2404.11888
2024/4/18
Learning to Rank Patches for Unbiased Image Redundancy Reduction
arXiv preprint arXiv:2404.00680
2024/3/31
Yang Luo
H-Index: 3
Zhineng Chen
H-Index: 13
Peng Zhou
H-Index: 12
Zuxuan Wu
H-Index: 22
Yu-Gang Jiang
H-Index: 45
OmniViD: A Generative Framework for Universal Video Understanding
2024/3/26