Ron Litman
10
Papers
42
Total Citations
Papers (10)
Question Aware Vision Transformer for Multimodal Reasoning
CVPR 2024
36
citations
VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding
ECCV 2024
6
citations
SCATTER: Selective Context Attentional Scene Text Recognizer
CVPR 2020arXiv
0
citations
Sequence-to-Sequence Contrastive Learning for Text Recognition
CVPR 2021arXiv
0
citations
LaTr: Layout-Aware Transformer for Scene-Text VQA
CVPR 2022arXiv
0
citations
Towards Models that Can See and Read
ICCV 2023arXiv
0
citations
CLIPTER: Looking at the Bigger Picture in Scene Text Recognition
ICCV 2023arXiv
0
citations
DocVLM: Make Your VLM an Efficient Reader
CVPR 2025
0
citations
TextAdaIN: Paying Attention to Shortcut Learning in Text Recognizers
ECCV 2022
0
citations
GRAM: Global Reasoning for Multi-Page VQA
CVPR 2024
0
citations