Ran Xu
10
Papers
462
Total Citations
Papers (10)
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
CVPR 2024arXiv
192
citations
HIVE: Harnessing Human Feedback for Instructional Visual Editing
CVPR 2024arXiv
164
citations
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
ICLR 2024arXiv
104
citations
Trust but Verify: Programmatic VLM Evaluation in the Wild
ICCV 2025
2
citations
Structured Policy Optimization: Enhance Large Vision-Language Model via Self-referenced Dialogue
ICCV 2025
0
citations
Towards Scalable Spatial Intelligence via 2D-to-3D Data Lifting
ICCV 2025
0
citations
Text2Data: Low-Resource Data Generation with Textual Control
AAAI 2025arXiv
0
citations
LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer
ECCV 2024arXiv
0
citations
Position: TrustLLM: Trustworthiness in Large Language Models
ICML 2024
0
citations
Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation
CVPR 2024
0
citations