Xin Fei
3
Papers
12
Total Citations
Papers (3)
VLA-OS: Structuring and Dissecting Planning Representations and Paradigms in Vision-Language-Action Models
NeurIPS 2025arXiv
8
citations
GeoAuxNet: Towards Universal 3D Representation Learning for Multi-sensor Point Clouds
CVPR 2024
4
citations
Scene Splatter: Momentum 3D Scene Generation from Single Image with Video Diffusion Model
CVPR 2025
0
citations