Zhibo Yang
5
Papers
70
Total Citations
Papers (5)
CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy
ICCV 2025arXiv
42
citations
Unifying Top-down and Bottom-up Scanpath Prediction Using Transformers
CVPR 2024
22
citations
DocThinker: Explainable Multimodal Large Language Models with Rule-based Reinforcement Learning for Document Understanding
ICCV 2025arXiv
4
citations
Platypus: A Generalized Specialist Model for Reading Text in Various Forms
ECCV 2024arXiv
2
citations
OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition
CVPR 2024
0
citations