by Yueming Xu Papers
2 papers found
4D-VLA: Spatiotemporal Vision-Language-Action Pretraining with Cross-Scene Calibration
Jiahui Zhang, Yurui Chen, Yueming Xu et al.
NeurIPS 2025oralarXiv:2506.22242
From Flatland to Space: Teaching Vision-Language Models to Perceive and Reason in 3D
Jiahui Zhang, Yurui Chen, Yueming Xu et al.
NeurIPS 2025posterarXiv:2503.22976