Zhaoyang Zeng
Sun Yat-Sen University
H-index: 12
Asia-China
Top articles of Zhaoyang Zeng
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Grounded sam: Assembling open-world models for diverse visual tasks | arXiv preprint arXiv:2401.14159 | Tianhe Ren Shilong Liu Ailing Zeng Jing Lin Kunchang Li | 2024/1/25 |
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy | arXiv preprint arXiv:2403.14610 | Qing Jiang Feng Li Zhaoyang Zeng Tianhe Ren Shilong Liu | 2024/3/21 |
TAPTR: Tracking Any Point with Transformers as Detection | arXiv preprint arXiv:2403.13042 | Hongyang Li Hao Zhang Shilong Liu Zhaoyang Zeng Tianhe Ren | 2024/3/19 |
DFA3D: 3D Deformable Attention For 2D-to-3D Feature Lifting | Hongyang Li Hao Zhang Zhaoyang Zeng Shilong Liu Feng Li | 2023 | |
T-Rex: Counting by Visual Prompting | arXiv preprint arXiv:2311.13596 | Qing Jiang Feng Li Tianhe Ren Shilong Liu Zhaoyang Zeng | 2023/11/22 |
Detection transformer with stable matching | Shilong Liu Tianhe Ren Jiayu Chen Zhaoyang Zeng Hao Zhang | 2023 | |
SMP Challenge: An Overview and Analysis of Social Media Prediction Challenge | Bo Wu Peiye Liu Wen-Huang Cheng Bei Liu Zhaoyang Zeng | 2023/10/26 | |
detrex: Benchmarking detection transformers | arXiv preprint arXiv:2306.07265 | Tianhe Ren Shilong Liu Feng Li Hao Zhang Ailing Zeng | 2023/6/12 |
A strong and reproducible object detector with only public datasets | arXiv preprint arXiv:2304.13027 | Tianhe Ren Jianwei Yang Shilong Liu Ailing Zeng Feng Li | 2023/4/25 |
Grounding dino: Marrying dino with grounded pre-training for open-set object detection | arXiv preprint arXiv:2303.05499 | Shilong Liu Zhaoyang Zeng Tianhe Ren Feng Li Hao Zhang | 2023/3/9 |
Stay in Grid: Improving Video Captioning via Fully Grid-Level Representation | IEEE Transactions on Circuits and Systems for Video Technology | Mingkang Tang Zhanyu Wang Zhaoyang Zeng Xiu Li Luping Zhou | 2022/12/27 |
Tencent-mvse: A large-scale benchmark dataset for multi-modal video similarity evaluation | Zhaoyang Zeng Yongsheng Luo Zhenhua Liu Fengyun Rao Dian Li | 2022 | |
Seeing out of the box: End-to-end pre-training for vision-language representation learning | Zhicheng Huang Zhaoyang Zeng Yupan Huang Bei Liu Dongmei Fu | 2021 | |
Multi-modal representation learning for video advertisement content structuring | Daya Guo Zhaoyang Zeng | 2021/10/17 | |
Clip4caption++: Multi-clip for video caption | arXiv preprint arXiv:2110.05204 | Mingkang Tang Zhanyu Wang Zhaoyang Zeng Fengyun Rao Dian Li | 2021/10/11 |
Be specific, be clear: Bridging machine and human captions by scene-guided transformer | Yupan Huang Zhaoyang Zeng Yutong Lu | 2021/8/21 | |
Reference-based defect detection network | IEEE Transactions on Image Processing | Zhaoyang Zeng Bei Liu Jianlong Fu Hongyang Chao | 2021/7/19 |
Contrastive learning of global and local video representations | Advances in Neural Information Processing Systems | Zhaoyang Zeng Daniel McDuff Yale Song | 2021/12/6 |
GarbageNet: a unified learning framework for robust garbage classification | IEEE Transactions on Artificial Intelligence | Jianfei Yang Zhaoyang Zeng Kai Wang Han Zou Lihua Xie | 2021/5/18 |
Mind the discriminability: Asymmetric adversarial domain adaptation | Jianfei Yang Han Zou Yuxun Zhou Zhaoyang Zeng Lihua Xie | 2020/8/23 |