ICLR 2025 "document understanding" Papers
2 papers found
Harnessing Webpage UIs for Text-Rich Visual Understanding
Junpeng Liu, Tianyue Ou, Yifan Song et al.
ICLR 2025posterarXiv:2410.13824
21
citations
Mini-Monkey: Alleviating the Semantic Sawtooth Effect for Lightweight MLLMs via Complementary Image Pyramid
Mingxin Huang, Yuliang Liu, Dingkang Liang et al.
ICLR 2025posterarXiv:2408.02034
22
citations