Yujun Shen

39
Papers
795
Total Citations

Papers (39)

GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation

ECCV 2024
259
citations

FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views

CVPR 2025
92
citations

Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following

CVPR 2024
78
citations

Language-Image Pre-training with Long Captions

ECCV 2024
63
citations

A Recipe for Scaling up Text-to-Video Generation with Text-free Videos

CVPR 2024
53
citations

SAM-guided Graph Cut for 3D Instance Segmentation

ECCV 2024
32
citations

MagicQuill: An Intelligent Interactive Image Editing System

CVPR 2025
25
citations

LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis

CVPR 2025arXiv
25
citations

Lipschitz Singularities in Diffusion Models

ICLR 2024
21
citations

Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation

CVPR 2025
20
citations

Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning

CVPR 2025
18
citations

EnvGS: Modeling View-Dependent Appearance with Environment Gaussian

CVPR 2025
16
citations

Mimir: Improving Video Diffusion Models for Precise Text Understanding

CVPR 2025
16
citations

Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis

CVPR 2025
15
citations

NEAT: Distilling 3D Wireframes from Neural Attraction Fields

CVPR 2024
11
citations

Rectified Diffusion Guidance for Conditional Generation

CVPR 2025arXiv
11
citations

PlanarSplatting: Accurate Planar Surface Reconstruction in 3 Minutes

CVPR 2025
9
citations

Towards More Accurate Diffusion Model Acceleration with A Timestep Tuner

CVPR 2024
9
citations

Contextual AD Narration with Interleaved Multimodal Sequence

CVPR 2025arXiv
7
citations

BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation

CVPR 2024
5
citations

Learning Visual Generative Priors without Text

CVPR 2025
4
citations

Neural Shell Texture Splatting: More Details and Fewer Primitives

ICCV 2025
4
citations

BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation

ICCV 2025
1
citations

ScaleLSD: Scalable Deep Line Segment Detection Streamlined

CVPR 2025
1
citations

SMaRt: Improving GANs with Score Matching Regularity

ICML 2024
0
citations

AvatarArtist: Open-Domain 4D Avatarization

CVPR 2025
0
citations

MangaNinja: Line Art Colorization with Precise Reference Following

CVPR 2025
0
citations

AniDoc: Animation Creation Made Easier

CVPR 2025
0
citations

DiffDoctor: Diagnosing Image Diffusion Models Before Treating

ICCV 2025
0
citations

ReTracker: Exploring Image Matching for Robust Online Any Point Tracking

ICCV 2025
0
citations

SpatialTrackerV2: Advancing 3D Point Tracking with Explicit Camera Motion

ICCV 2025
0
citations

Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models

ICCV 2025
0
citations

Edicho: Consistent Image Editing in the Wild

ICCV 2025
0
citations

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

CVPR 2024
0
citations

AnyDoor: Zero-shot Object-level Image Customization

CVPR 2024
0
citations

SpatialTracker: Tracking Any 2D Pixels in 3D Space

CVPR 2024
0
citations

4K4D: Real-Time 4D View Synthesis at 4K Resolution

CVPR 2024
0
citations

CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency Models

ICML 2024
0
citations

Learning Temporally Consistent Video Depth from Video Diffusion Priors

CVPR 2025
0
citations