Fei Xia
7
Papers
723
Total Citations
Papers (7)
SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities
CVPR 2024
550
citations
Video Language Planning
ICLR 2024
144
citations
DriveGPT4-V2: Harnessing Large Language Model Capabilities for Enhanced Closed-Loop Autonomous Driving
CVPR 2025
17
citations
Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural Networks
ICLR 2024
12
citations
MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections
CVPR 2024
0
citations
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
ICML 2024
0
citations
PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
ICML 2024
0
citations