Aniruddha Kembhavi
10
Papers
198
Total Citations
Papers (10)
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models
CVPR 2025
96
citations
SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World
CVPR 2024
52
citations
One Diffusion to Generate Them All
CVPR 2025
34
citations
Iterated Learning Improves Compositionality in Large Vision-Language Models
CVPR 2024
16
citations
Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences
CVPR 2024
0
citations
Eval3D: Interpretable and Fine-grained Evaluation for 3D Generation
CVPR 2025
0
citations
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision Language Audio and Action
CVPR 2024
0
citations
Seeing the Unseen: Visual Common Sense for Semantic Placement
CVPR 2024
0
citations
ReSpec: Relevance and Specificity Grounded Online Filtering for Learning on Video-Text Data Streams
CVPR 2025
0
citations
Holodeck: Language Guided Generation of 3D Embodied AI Environments
CVPR 2024
0
citations