Poster "vision token compression" Papers
3 papers found
Compress & Cache: Vision token compression for efficient generation and retrieval
Adrian Bulat, Yassine Ouali, Georgios Tzimiropoulos
NeurIPS 2025poster
HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models
Runhui Huang, Xinpeng Ding, Chunwei Wang et al.
CVPR 2025posterarXiv:2407.08706
13
citations
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token
Shaolei Zhang, Qingkai Fang, Yang et al.
ICLR 2025posterarXiv:2501.03895
106
citations