"vision-language learning" Papers
2 papers found
Perception Encoder: The best visual embeddings are not at the output of the network
Daniel Bolya, Po-Yao Huang, Peize Sun et al.
NeurIPS 2025oralarXiv:2504.13181
118
citations
Vision-Language Action Knowledge Learning for Semantic-Aware Action Quality Assessment
Huangbiao Xu, Xiao Ke, Yuezhou Li et al.
ECCV 2024poster
14
citations