Ron Litman
4
Papers
42
Total Citations
Papers (4)
Question Aware Vision Transformer for Multimodal Reasoning
CVPR 2024arXiv
36
citations
VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding
ECCV 2024arXiv
6
citations
DocVLM: Make Your VLM an Efficient Reader
CVPR 2025arXiv
0
citations
GRAM: Global Reasoning for Multi-Page VQA
CVPR 2024arXiv
0
citations