Le Xue
5
Papers
309
Total Citations
Papers (5)
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
CVPR 2024
192
citations
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
ICLR 2024
104
citations
LLAVIDAL: A Large LAnguage VIsion Model for Daily Activities of Living
CVPR 2025
7
citations
X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-modal Reasoning
ECCV 2024
6
citations
SegAnyPET: Universal Promptable Segmentation from Positron Emission Tomography Images
ICCV 2025
0
citations