Xiang Bai

About Xiang Bai

Xiang Bai, With an exceptional h-index of 97 and a recent h-index of 83 (since 2020), a distinguished researcher at Huazhong University of Science and Technology, specializes in the field of Computer Vision, OCR.

His recent articles reflect a diverse array of research interests and contributions to the field:

Class-Aware Mask-guided feature refinement for scene text recognition

DSText V2: A comprehensive video text spotting dataset for dense and small text

VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization

TextSquare: Scaling up Text-Centric Visual Instruction Tuning

Maskstr: Guide Scene Text Recognition Models with Masking

Bridging the Gap Between End-to-End and Two-Step Text Spotting

OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition

PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model

Xiang Bai Information

University

Position

(HUST)

Citations(all)

39788

Citations(since 2020)

31629

Cited By

18720

hIndex(all)

97

hIndex(since 2020)

83

i10Index(all)

229

i10Index(since 2020)

198

Email

University Profile Page

Google Scholar

Xiang Bai Skills & Research Interests

Computer Vision

OCR

Top articles of Xiang Bai

Class-Aware Mask-guided feature refinement for scene text recognition

Pattern Recognition

2024/5/1

DSText V2: A comprehensive video text spotting dataset for dense and small text

Pattern Recognition

2024/5/1

VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization

arXiv preprint arXiv:2404.19652

2024/4/30

TextSquare: Scaling up Text-Centric Visual Instruction Tuning

arXiv preprint arXiv:2404.12803

2024/4/19

Maskstr: Guide Scene Text Recognition Models with Masking

2024/4/14

Bridging the Gap Between End-to-End and Two-Step Text Spotting

arXiv preprint arXiv:2404.04624

2024/4/6

OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition

arXiv preprint arXiv:2403.19128

2024/3/28

PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model

arXiv preprint arXiv:2403.14598

2024/3/21

Zheng Zhang
Zheng Zhang

H-Index: 1

Xiang Bai
Xiang Bai

H-Index: 63

Turning a clip model into a scene text spotter

IEEE Transactions on Pattern Analysis and Machine Intelligence

2024/3/20

Anomaly Detection by Adapting a pre-trained Vision Language Model

arXiv preprint arXiv:2403.09493

2024/3/14

Textmonkey: An ocr-free large multimodal model for understanding document

arXiv preprint arXiv:2403.04473

2024/3/7

Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis

IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR)

2024/3/3

PointMamba: A Simple State Space Model for Point Cloud Analysis

arXiv preprint arXiv:2402.10739

2024/2/16

Query-based Temporal Fusion with Explicit Motion for 3D Object Detection

2023/11/2

Xiang Bai
Xiang Bai

H-Index: 63

Sequential visual and semantic consistency for semi-supervised text recognition

Pattern Recognition Letters

2024/2/1

CauESC: A Causal Aware Model for Emotional Support Conversation

arXiv preprint arXiv:2401.17755

2024/1/31

An open dataset for oracle bone script recognition and decipherment

arXiv preprint arXiv:2401.15365

2024/1/27

An open dataset for the evolution of oracle bone characters: EVOBC

arXiv preprint arXiv:2401.12467

2024/1/23

SwinTextSpotter v2: Towards Better Synergy for Scene Text Spotting

arXiv preprint arXiv:2401.07641

2024/1/15

Singleinsert: Inserting new concepts from a single image into text-to-image models for flexible editing

arXiv preprint arXiv:2310.08094

2023/10/12

See List of Professors in Xiang Bai University(Huazhong University of Science and Technology)

Co-Authors

academic-engine