"vision-language representation learning" Papers
2 papers found
FLIP: Flow-Centric Generative Planning as General-Purpose Manipulation World Model
Chongkai Gao, Haozhuo Zhang, Zhixuan Xu et al.
ICLR 2025posterarXiv:2412.08261
24
citations
CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings
Cristina Mata, Kanchana N Ranasinghe, Michael S Ryoo
ECCV 2024posterarXiv:2507.07125
5
citations