2024 Poster "image understanding" Papers
2 papers found
An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models
Liang Chen, Haozhe Zhao, Tianyu Liu et al.
ECCV 2024posterarXiv:2403.06764
343
citations
Scene Graph Generation Strategy with Co-occurrence Knowledge and Learnable Term Frequency
Hyeongjin Kim, Sangwon Kim, Dasom Ahn et al.
ICML 2024posterarXiv:2405.12648