"video representation learning" Papers

10 papers found

MoSiC: Optimal-Transport Motion Trajectory for Dense Self-Supervised Learning

Mohammadreza Salehi, Shashanka Venkataramanan, Ioana Simion et al.

ICCV 2025posterarXiv:2506.08694
1
citations

SEAL: Semantic Attention Learning for Long Video Representation

Lan Wang, Yujia Chen, Wen-Sheng Chu et al.

CVPR 2025posterarXiv:2412.01798
7
citations

SMILE: Infusing Spatial and Motion Semantics in Masked Video Learning

Fida Mohammad Thoker, Letian Jiang, Chen Zhao et al.

CVPR 2025posterarXiv:2504.00527
3
citations

TAMT: Temporal-Aware Model Tuning for Cross-Domain Few-Shot Action Recognition

yilong wang, Zilin Gao, Qilong Wang et al.

CVPR 2025posterarXiv:2411.19041
3
citations

VQToken: Neural Discrete Token Representation Learning for Extreme Token Reduction in Video Large Language Models

Haichao Zhang, Yun Fu

NeurIPS 2025oralarXiv:2503.16980
3
citations

What Do Latent Action Models Actually Learn?

Chuheng Zhang, Tim Pearce, Pushi Zhang et al.

NeurIPS 2025posterarXiv:2506.15691
7
citations

EVEREST: Efficient Masked Video Autoencoder by Removing Redundant Spatiotemporal Tokens

Sunil Hwang, Jaehong Yoon, Youngwan Lee et al.

ICML 2024oral

Sequential Disentanglement by Extracting Static Information From A Single Sequence Element

Nimrod Berman, Ilan Naiman, Idan Arbiv et al.

ICML 2024poster

STSP: Spatial-Temporal Subspace Projection for Video Class-incremental Learning

Hao CHENG, SIYUAN YANG, Chong Wang et al.

ECCV 2024poster
6
citations

XKD: Cross-Modal Knowledge Distillation with Domain Alignment for Video Representation Learning

Pritam Sarkar, Ali Etemad

AAAI 2024paperarXiv:2211.13929
38
citations