"visual feature extraction" Papers
2 papers found
DynImg: Key Frames with Visual Prompts are Good Representation for Multi-Modal Video Understanding
Xiaoyi Bao, Chen-Wei Xie, Hao Tang et al.
ICCV 2025posterarXiv:2507.15569
1
citations
Image Over Text: Transforming Formula Recognition Evaluation with Character Detection Matching
Bin Wang, Fan Wu, Linke Ouyang et al.
CVPR 2025posterarXiv:2409.03643
13
citations