Jiajun Wu

31
Papers
740
Total Citations
1
Affiliations

Affiliations

Stanford University

Papers (31)

ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding

CVPR 2024
192
citations

Physics-Based Interaction with 3D Objects via Video Generation

ECCV 2024arXiv
137
citations

WonderWorld: Interactive 3D Scene Generation from a Single Image

CVPR 2025
120
citations

ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Image

CVPR 2024
85
citations

LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models

CVPR 2025
44
citations

Learning the 3D Fauna of the Web

CVPR 2024
42
citations

Re-thinking Temporal Search for Long-Form Video Understanding

CVPR 2025
36
citations

The Scene Language: Representing Scenes with Programs, Words, and Embeddings

CVPR 2025
15
citations

BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation

CVPR 2024
14
citations

Language-Informed Visual Concept Learning

ICLR 2024
12
citations

FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video

CVPR 2025
11
citations

Ponymation: Learning Articulated 3D Animal Motions from Unlabeled Online Videos

ECCV 2024
10
citations

Naturally Supervised 3D Visual Grounding with Language-Regularized Concept Learners

CVPR 2024
9
citations

Birth and Death of a Rose

CVPR 2025
5
citations

PGC: Physics-Based Gaussian Cloth from a Single Pose

CVPR 2025
3
citations

Taming generative video models for zero-shot optical flow extraction

NeurIPS 2025
3
citations

Category-Agnostic Neural Object Rigging

CVPR 2025arXiv
2
citations

Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning

ICML 2024
0
citations

Digital Twin Catalog: A Large-Scale Photorealistic 3D Object Digital Twin Dataset

CVPR 2025
0
citations

Lifting Motion to the 3D World via 2D Diffusion

CVPR 2025
0
citations

Diffusion Self-Distillation for Zero-Shot Customized Image Generation

CVPR 2025
0
citations

X-Capture: An Open-Source Portable Device for Multi-Sensory Learning

ICCV 2025
0
citations

Weakly-Supervised Learning of Dense Functional Correspondences

ICCV 2025
0
citations

WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions

ICCV 2025
0
citations

Flow to the Mode: Mode-Seeking Diffusion Autoencoders for State-of-the-Art Image Tokenization

ICCV 2025
0
citations

WorldScore: Unified Evaluation Benchmark for World Generation

ICCV 2025
0
citations

HVAdam: A Full-Dimension Adaptive Optimizer

AAAI 2025
0
citations

SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote Sensing

AAAI 2024
0
citations

Hearing Anything Anywhere

CVPR 2024
0
citations

Holodeck: Language Guided Generation of 3D Embodied AI Environments

CVPR 2024
0
citations

WonderJourney: Going from Anywhere to Everywhere

CVPR 2024
0
citations