Qi Zheng
8
Papers
185
Total Citations
Papers (8)
LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding
CVPR 2024
98
citations
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
ICLR 2025
67
citations
A Simple yet Effective Layout Token in Large Language Models for Document Understanding
CVPR 2025arXiv
7
citations
ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data
AAAI 2025
6
citations
WebRPG: Automatic Web Rendering Parameters Generation for Visual Presentation
ECCV 2024arXiv
5
citations
End-to-End HOI Reconstruction Transformer with Graph-based Encoding
CVPR 2025
1
citations
ST-ReP: Learning Predictive Representations Efficiently for Spatial-Temporal Forecasting
AAAI 2025
1
citations
Frequency-Biased Synergistic Design for Image Compression and Compensation
CVPR 2025
0
citations