"video representation learning" Papers
8 papers found
MoSiC: Optimal-Transport Motion Trajectory for Dense Self-Supervised Learning
Mohammadreza Salehi, Shashanka Venkataramanan, Ioana Simion et al.
ICCV 2025posterarXiv:2506.08694
1
citations
SEAL: Semantic Attention Learning for Long Video Representation
Lan Wang, Yujia Chen, Wen-Sheng Chu et al.
CVPR 2025posterarXiv:2412.01798
7
citations
SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning
Fida Mohammad Thoker, Letian Jiang, Chen Zhao et al.
CVPR 2025posterarXiv:2504.00527
3
citations
VQToken: Neural Discrete Token Representation Learning for Extreme Token Reduction in Video Large Language Models
Haichao Zhang, Yun Fu
NeurIPS 2025oralarXiv:2503.16980
3
citations
What Do Latent Action Models Actually Learn?
Chuheng Zhang, Tim Pearce, Pushi Zhang et al.
NeurIPS 2025posterarXiv:2506.15691
7
citations
EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens
Sunil Hwang, Jaehong Yoon, Youngwan Lee et al.
ICML 2024oral
Sequential Disentanglement by Extracting Static Information From A Single Sequence Element
Nimrod Berman, Ilan Naiman, Idan Arbiv et al.
ICML 2024poster
XKD: Cross-Modal Knowledge Distillation with Domain Alignment for Video Representation Learning
Pritam Sarkar, Ali Etemad
AAAI 2024paperarXiv:2211.13929
38
citations