Qi Zheng
7
Papers
180
Total Citations
Papers (7)
LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding
CVPR 2024
98
citations
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
ICLR 2025
67
citations
A Simple yet Effective Layout Token in Large Language Models for Document Understanding
CVPR 2025arXiv
7
citations
ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data
AAAI 2025
6
citations
End-to-End HOI Reconstruction Transformer with Graph-based Encoding
CVPR 2025
1
citations
ST-ReP: Learning Predictive Representations Efficiently for Spatial-Temporal Forecasting
AAAI 2025
1
citations
Frequency-Biased Synergistic Design for Image Compression and Compensation
CVPR 2025
0
citations