"visual foundation models" Papers
4 papers found
A Token-level Text Image Foundation Model for Document Understanding
Tongkun Guan, Zining Wang, Pei Fu et al.
ICCV 2025posterarXiv:2503.02304
4
citations
InstantSplamp: Fast and Generalizable Stenography Framework for Generative Gaussian Splatting
Chenxin Li, Hengyu Liu, Zhiwen Fan et al.
ICLR 2025poster
12
citations
Knowledge Transfer from Interaction Learning
Yilin Gao, Kangyi Chen, Zhongxing Peng et al.
ICCV 2025posterarXiv:2509.18733
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL
Fangwei Zhong, Kui Wu, Hai Ci et al.
ECCV 2024posterarXiv:2404.09857
13
citations