Yujun Shen
39
Papers
795
Total Citations
Papers (39)
GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation
ECCV 2024
259
citations
FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views
CVPR 2025
92
citations
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
CVPR 2024
78
citations
Language-Image Pre-training with Long Captions
ECCV 2024
63
citations
A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
CVPR 2024
53
citations
SAM-guided Graph Cut for 3D Instance Segmentation
ECCV 2024
32
citations
MagicQuill: An Intelligent Interactive Image Editing System
CVPR 2025
25
citations
LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
CVPR 2025arXiv
25
citations
Lipschitz Singularities in Diffusion Models
ICLR 2024
21
citations
Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation
CVPR 2025
20
citations
Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning
CVPR 2025
18
citations
EnvGS: Modeling View-Dependent Appearance with Environment Gaussian
CVPR 2025
16
citations
Mimir: Improving Video Diffusion Models for Precise Text Understanding
CVPR 2025
16
citations
Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis
CVPR 2025
15
citations
NEAT: Distilling 3D Wireframes from Neural Attraction Fields
CVPR 2024
11
citations
Rectified Diffusion Guidance for Conditional Generation
CVPR 2025arXiv
11
citations
PlanarSplatting: Accurate Planar Surface Reconstruction in 3 Minutes
CVPR 2025
9
citations
Towards More Accurate Diffusion Model Acceleration with A Timestep Tuner
CVPR 2024
9
citations
Contextual AD Narration with Interleaved Multimodal Sequence
CVPR 2025arXiv
7
citations
BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation
CVPR 2024
5
citations
Learning Visual Generative Priors without Text
CVPR 2025
4
citations
Neural Shell Texture Splatting: More Details and Fewer Primitives
ICCV 2025
4
citations
BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation
ICCV 2025
1
citations
ScaleLSD: Scalable Deep Line Segment Detection Streamlined
CVPR 2025
1
citations
SMaRt: Improving GANs with Score Matching Regularity
ICML 2024
0
citations
AvatarArtist: Open-Domain 4D Avatarization
CVPR 2025
0
citations
MangaNinja: Line Art Colorization with Precise Reference Following
CVPR 2025
0
citations
AniDoc: Animation Creation Made Easier
CVPR 2025
0
citations
DiffDoctor: Diagnosing Image Diffusion Models Before Treating
ICCV 2025
0
citations
ReTracker: Exploring Image Matching for Robust Online Any Point Tracking
ICCV 2025
0
citations
SpatialTrackerV2: Advancing 3D Point Tracking with Explicit Camera Motion
ICCV 2025
0
citations
Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models
ICCV 2025
0
citations
Edicho: Consistent Image Editing in the Wild
ICCV 2025
0
citations
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
CVPR 2024
0
citations
AnyDoor: Zero-shot Object-level Image Customization
CVPR 2024
0
citations
SpatialTracker: Tracking Any 2D Pixels in 3D Space
CVPR 2024
0
citations
4K4D: Real-Time 4D View Synthesis at 4K Resolution
CVPR 2024
0
citations
CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency Models
ICML 2024
0
citations
Learning Temporally Consistent Video Depth from Video Diffusion Priors
CVPR 2025
0
citations