Poster "multi-modal tasks" Papers
4 papers found
A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs
Wangbo Zhao, Yizeng Han, Jiasheng Tang et al.
CVPR 2025posterarXiv:2412.03324
23
citations
Balanced Token Pruning: Accelerating Vision Language Models Beyond Local Optimization
kaiyuan Li, Xiaoyue Chen, Chen Gao et al.
NeurIPS 2025posterarXiv:2505.22038
4
citations
Dataset Growth
Ziheng Qin, zhaopan xu, YuKun Zhou et al.
ECCV 2024posterarXiv:2405.18347
4
citations
PiTe: Pixel-Temporal Alignment for Large Video-Language Model
Yang Liu, Pengxiang Ding, Siteng Huang et al.
ECCV 2024posterarXiv:2409.07239
9
citations