Yujun Shen

78
Papers
795
Total Citations

Papers (78)

GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation

ECCV 2024
259
citations

FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views

CVPR 2025
92
citations

Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following

CVPR 2024
78
citations

Language-Image Pre-training with Long Captions

ECCV 2024
63
citations

A Recipe for Scaling up Text-to-Video Generation with Text-free Videos

CVPR 2024
53
citations

SAM-guided Graph Cut for 3D Instance Segmentation

ECCV 2024
32
citations

LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis

CVPR 2025arXiv
25
citations

MagicQuill: An Intelligent Interactive Image Editing System

CVPR 2025
25
citations

Lipschitz Singularities in Diffusion Models

ICLR 2024
21
citations

Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation

CVPR 2025
20
citations

Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning

CVPR 2025
18
citations

EnvGS: Modeling View-Dependent Appearance with Environment Gaussian

CVPR 2025
16
citations

Mimir: Improving Video Diffusion Models for Precise Text Understanding

CVPR 2025
16
citations

Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis

CVPR 2025
15
citations

Rectified Diffusion Guidance for Conditional Generation

CVPR 2025arXiv
11
citations

NEAT: Distilling 3D Wireframes from Neural Attraction Fields

CVPR 2024
11
citations

Towards More Accurate Diffusion Model Acceleration with A Timestep Tuner

CVPR 2024
9
citations

PlanarSplatting: Accurate Planar Surface Reconstruction in 3 Minutes

CVPR 2025
9
citations

Contextual AD Narration with Interleaved Multimodal Sequence

CVPR 2025arXiv
7
citations

BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation

CVPR 2024
5
citations

Learning Visual Generative Priors without Text

CVPR 2025
4
citations

Neural Shell Texture Splatting: More Details and Fewer Primitives

ICCV 2025
4
citations

ScaleLSD: Scalable Deep Line Segment Detection Streamlined

CVPR 2025
1
citations

BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation

ICCV 2025
1
citations

Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels

CVPR 2022arXiv
0
citations

DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-Aware Scene Synthesis

CVPR 2023arXiv
0
citations

Neural Dependencies Emerging From Learning Massive Categories

CVPR 2023arXiv
0
citations

GLeaD: Improving GANs With a Generator-Leading Task

CVPR 2023arXiv
0
citations

Balancing Logit Variation for Long-Tailed Semantic Segmentation

CVPR 2023
0
citations

Learning 3D-Aware Image Synthesis With Unknown Pose Distribution

CVPR 2023arXiv
0
citations

Dimensionality-Varying Diffusion Process

CVPR 2023arXiv
0
citations

LipFormer: High-Fidelity and Generalizable Talking Face Generation With a Pre-Learned Facial Codebook

CVPR 2023
0
citations

LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis

ICCV 2023arXiv
0
citations

ViM: Vision Middleware for Unified Downstream Transferring

ICCV 2023arXiv
0
citations

One-Shot Generative Domain Adaptation

ICCV 2023arXiv
0
citations

Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-Trained Vision-Language Models

ICCV 2023arXiv
0
citations

Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos

ICCV 2023arXiv
0
citations

In-Domain GAN Inversion for Real Image Editing

ECCV 2020
0
citations

High-Fidelity GAN Inversion with Padding Space

ECCV 2022
0
citations

3D-Aware Indoor Scene Synthesis with Depth Priors

ECCV 2022
0
citations

ReTracker: Exploring Image Matching for Robust Online Any Point Tracking

ICCV 2025
0
citations

AvatarArtist: Open-Domain 4D Avatarization

CVPR 2025
0
citations

MangaNinja: Line Art Colorization with Precise Reference Following

CVPR 2025
0
citations

AniDoc: Animation Creation Made Easier

CVPR 2025
0
citations

DiffDoctor: Diagnosing Image Diffusion Models Before Treating

ICCV 2025
0
citations

Learning Temporally Consistent Video Depth from Video Diffusion Priors

CVPR 2025
0
citations

SpatialTrackerV2: Advancing 3D Point Tracking with Explicit Camera Motion

ICCV 2025
0
citations

Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models

ICCV 2025
0
citations

Edicho: Consistent Image Editing in the Wild

ICCV 2025
0
citations

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

CVPR 2024
0
citations

AnyDoor: Zero-shot Object-level Image Customization

CVPR 2024
0
citations

SpatialTracker: Tracking Any 2D Pixels in 3D Space

CVPR 2024
0
citations

4K4D: Real-Time 4D View Synthesis at 4K Resolution

CVPR 2024
0
citations

CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency Models

ICML 2024
0
citations

SMaRt: Improving GANs with Score Matching Regularity

ICML 2024
0
citations

FaceID-GAN: Learning a Symmetry Three-Player GAN for Identity-Preserving Face Synthesis

CVPR 2018
0
citations

Image Processing Using Multi-Code GAN Prior

CVPR 2020arXiv
0
citations

Interpreting the Latent Space of GANs for Semantic Face Editing

CVPR 2020arXiv
0
citations

Closed-Form Factorization of Latent Semantics in GANs

CVPR 2021arXiv
0
citations

Glancing at the Patch: Anomaly Localization With Global and Local Feature Comparison

CVPR 2021
0
citations

Generative Hierarchical Features From Synthesizing Images

CVPR 2021arXiv
0
citations

3D-Aware Image Synthesis via Learning Structural and Textural Representations

CVPR 2022arXiv
0
citations

Improving GAN Equilibrium by Raising Spatial Awareness

CVPR 2022arXiv
0
citations

Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition

CVPR 2022arXiv
0
citations

Data-Efficient Instance Generation from Instance Discrimination

NeurIPS 2021
0
citations

Low-Rank Subspaces in GANs

NeurIPS 2021
0
citations

A Unified Model for Multi-class Anomaly Detection

NeurIPS 2022
0
citations

Learning from Future: A Novel Self-Training Framework for Semantic Segmentation

NeurIPS 2022
0
citations

Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator

NeurIPS 2022
0
citations

Improving GANs with A Dynamic Discriminator

NeurIPS 2022
0
citations

Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase

NeurIPS 2023
0
citations

Learning Modulated Transformation in GANs

NeurIPS 2023
0
citations

VideoComposer: Compositional Video Synthesis with Motion Controllability

NeurIPS 2023
0
citations

Revisiting the Evaluation of Image Synthesis with GANs

NeurIPS 2023
0
citations

FaceComposer: A Unified Model for Versatile Facial Content Creation

NeurIPS 2023
0
citations

Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from Backbone

NeurIPS 2023
0
citations

Customizable Image Synthesis with Multiple Subjects

NeurIPS 2023
0
citations

Compact Neural Volumetric Video Representations with Dynamic Codebooks

NeurIPS 2023
0
citations