Juan Carlos Niebles
10
Papers
436
Total Citations
2
Affiliations
Affiliations
SalesforceStanford University
Papers (10)
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
CVPR 2024
192
citations
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
ICLR 2024
104
citations
APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay
NeurIPS 2025arXiv
71
citations
Re-thinking Temporal Search for Long-Form Video Understanding
CVPR 2025
36
citations
LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer
ECCV 2024arXiv
14
citations
X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-modal Reasoning
ECCV 2024
6
citations
Exploring Diffusion Transformer Designs via Grafting
NeurIPS 2025arXiv
4
citations
UniEgoMotion: A Unified Model for Egocentric Motion Reconstruction, Forecasting, and Generation
ICCV 2025
4
citations
Taming generative video models for zero-shot optical flow extraction
NeurIPS 2025
3
citations
ViUniT: Visual Unit Tests for More Robust Visual Programming
CVPR 2025
2
citations