Haoyu Cao
4
Papers
38
Total Citations
Papers (4)
Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models
CVPR 2024
37
citations
BASIC: Boosting Visual Alignment with Intrinsic Refined Embeddings in Multimodal Large Language Models
ICCV 2025
1
citations
HRVDA: High-Resolution Visual Document Assistant
CVPR 2024
0
citations
Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration
ICCV 2023arXiv
0
citations