Jiajun Wu
31
Papers
740
Total Citations
1
Affiliations
Affiliations
Stanford University
Papers (31)
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding
CVPR 2024
192
citations
Physics-Based Interaction with 3D Objects via Video Generation
ECCV 2024arXiv
137
citations
WonderWorld: Interactive 3D Scene Generation from a Single Image
CVPR 2025
120
citations
ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Image
CVPR 2024
85
citations
LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models
CVPR 2025
44
citations
Learning the 3D Fauna of the Web
CVPR 2024
42
citations
Re-thinking Temporal Search for Long-Form Video Understanding
CVPR 2025
36
citations
The Scene Language: Representing Scenes with Programs, Words, and Embeddings
CVPR 2025
15
citations
BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation
CVPR 2024
14
citations
Language-Informed Visual Concept Learning
ICLR 2024
12
citations
FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video
CVPR 2025
11
citations
Ponymation: Learning Articulated 3D Animal Motions from Unlabeled Online Videos
ECCV 2024
10
citations
Naturally Supervised 3D Visual Grounding with Language-Regularized Concept Learners
CVPR 2024
9
citations
Birth and Death of a Rose
CVPR 2025
5
citations
PGC: Physics-Based Gaussian Cloth from a Single Pose
CVPR 2025
3
citations
Taming generative video models for zero-shot optical flow extraction
NeurIPS 2025
3
citations
Category-Agnostic Neural Object Rigging
CVPR 2025arXiv
2
citations
Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning
ICML 2024
0
citations
Digital Twin Catalog: A Large-Scale Photorealistic 3D Object Digital Twin Dataset
CVPR 2025
0
citations
Lifting Motion to the 3D World via 2D Diffusion
CVPR 2025
0
citations
Diffusion Self-Distillation for Zero-Shot Customized Image Generation
CVPR 2025
0
citations
X-Capture: An Open-Source Portable Device for Multi-Sensory Learning
ICCV 2025
0
citations
Weakly-Supervised Learning of Dense Functional Correspondences
ICCV 2025
0
citations
WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions
ICCV 2025
0
citations
Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization
ICCV 2025
0
citations
WorldScore: Unified Evaluation Benchmark for World Generation
ICCV 2025
0
citations
HVAdam: A Full-Dimension Adaptive Optimizer
AAAI 2025
0
citations
SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing
AAAI 2024
0
citations
Hearing Anything Anywhere
CVPR 2024
0
citations
Holodeck: Language Guided Generation of 3D Embodied AI Environments
CVPR 2024
0
citations
WonderJourney: Going from Anywhere to Everywhere
CVPR 2024
0
citations