NeurIPS "video representation learning" Papers
2 papers found
VQToken: Neural Discrete Token Representation Learning for Extreme Token Reduction in Video Large Language Models
Haichao Zhang, Yun Fu
NeurIPS 2025oralarXiv:2503.16980
3
citations
What Do Latent Action Models Actually Learn?
Chuheng Zhang, Tim Pearce, Pushi Zhang et al.
NeurIPS 2025posterarXiv:2506.15691
7
citations