Yujun Shen
78
Papers
795
Total Citations
Papers (78)
GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation
ECCV 2024
259
citations
FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views
CVPR 2025
92
citations
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
CVPR 2024
78
citations
Language-Image Pre-training with Long Captions
ECCV 2024
63
citations
A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
CVPR 2024
53
citations
SAM-guided Graph Cut for 3D Instance Segmentation
ECCV 2024
32
citations
LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
CVPR 2025arXiv
25
citations
MagicQuill: An Intelligent Interactive Image Editing System
CVPR 2025
25
citations
Lipschitz Singularities in Diffusion Models
ICLR 2024
21
citations
Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation
CVPR 2025
20
citations
Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning
CVPR 2025
18
citations
EnvGS: Modeling View-Dependent Appearance with Environment Gaussian
CVPR 2025
16
citations
Mimir: Improving Video Diffusion Models for Precise Text Understanding
CVPR 2025
16
citations
Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis
CVPR 2025
15
citations
Rectified Diffusion Guidance for Conditional Generation
CVPR 2025arXiv
11
citations
NEAT: Distilling 3D Wireframes from Neural Attraction Fields
CVPR 2024
11
citations
Towards More Accurate Diffusion Model Acceleration with A Timestep Tuner
CVPR 2024
9
citations
PlanarSplatting: Accurate Planar Surface Reconstruction in 3 Minutes
CVPR 2025
9
citations
Contextual AD Narration with Interleaved Multimodal Sequence
CVPR 2025arXiv
7
citations
BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation
CVPR 2024
5
citations
Learning Visual Generative Priors without Text
CVPR 2025
4
citations
Neural Shell Texture Splatting: More Details and Fewer Primitives
ICCV 2025
4
citations
ScaleLSD: Scalable Deep Line Segment Detection Streamlined
CVPR 2025
1
citations
BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation
ICCV 2025
1
citations
Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels
CVPR 2022arXiv
0
citations
DisCoScene: Spatially Disentangled Generative Radiance Fields for Controllable 3D-Aware Scene Synthesis
CVPR 2023arXiv
0
citations
Neural Dependencies Emerging From Learning Massive Categories
CVPR 2023arXiv
0
citations
GLeaD: Improving GANs With a Generator-Leading Task
CVPR 2023arXiv
0
citations
Balancing Logit Variation for Long-Tailed Semantic Segmentation
CVPR 2023
0
citations
Learning 3D-Aware Image Synthesis With Unknown Pose Distribution
CVPR 2023arXiv
0
citations
Dimensionality-Varying Diffusion Process
CVPR 2023arXiv
0
citations
LipFormer: High-Fidelity and Generalizable Talking Face Generation With a Pre-Learned Facial Codebook
CVPR 2023
0
citations
LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis
ICCV 2023arXiv
0
citations
ViM: Vision Middleware for Unified Downstream Transferring
ICCV 2023arXiv
0
citations
One-Shot Generative Domain Adaptation
ICCV 2023arXiv
0
citations
Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-Trained Vision-Language Models
ICCV 2023arXiv
0
citations
Scanning Only Once: An End-to-end Framework for Fast Temporal Grounding in Long Videos
ICCV 2023arXiv
0
citations
In-Domain GAN Inversion for Real Image Editing
ECCV 2020
0
citations
High-Fidelity GAN Inversion with Padding Space
ECCV 2022
0
citations
3D-Aware Indoor Scene Synthesis with Depth Priors
ECCV 2022
0
citations
ReTracker: Exploring Image Matching for Robust Online Any Point Tracking
ICCV 2025
0
citations
AvatarArtist: Open-Domain 4D Avatarization
CVPR 2025
0
citations
MangaNinja: Line Art Colorization with Precise Reference Following
CVPR 2025
0
citations
AniDoc: Animation Creation Made Easier
CVPR 2025
0
citations
DiffDoctor: Diagnosing Image Diffusion Models Before Treating
ICCV 2025
0
citations
Learning Temporally Consistent Video Depth from Video Diffusion Priors
CVPR 2025
0
citations
SpatialTrackerV2: Advancing 3D Point Tracking with Explicit Camera Motion
ICCV 2025
0
citations
Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models
ICCV 2025
0
citations
Edicho: Consistent Image Editing in the Wild
ICCV 2025
0
citations
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
CVPR 2024
0
citations
AnyDoor: Zero-shot Object-level Image Customization
CVPR 2024
0
citations
SpatialTracker: Tracking Any 2D Pixels in 3D Space
CVPR 2024
0
citations
4K4D: Real-Time 4D View Synthesis at 4K Resolution
CVPR 2024
0
citations
CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency Models
ICML 2024
0
citations
SMaRt: Improving GANs with Score Matching Regularity
ICML 2024
0
citations
FaceID-GAN: Learning a Symmetry Three-Player GAN for Identity-Preserving Face Synthesis
CVPR 2018
0
citations
Image Processing Using Multi-Code GAN Prior
CVPR 2020arXiv
0
citations
Interpreting the Latent Space of GANs for Semantic Face Editing
CVPR 2020arXiv
0
citations
Closed-Form Factorization of Latent Semantics in GANs
CVPR 2021arXiv
0
citations
Glancing at the Patch: Anomaly Localization With Global and Local Feature Comparison
CVPR 2021
0
citations
Generative Hierarchical Features From Synthesizing Images
CVPR 2021arXiv
0
citations
3D-Aware Image Synthesis via Learning Structural and Textural Representations
CVPR 2022arXiv
0
citations
Improving GAN Equilibrium by Raising Spatial Awareness
CVPR 2022arXiv
0
citations
Cross-Model Pseudo-Labeling for Semi-Supervised Action Recognition
CVPR 2022arXiv
0
citations
Data-Efficient Instance Generation from Instance Discrimination
NeurIPS 2021
0
citations
Low-Rank Subspaces in GANs
NeurIPS 2021
0
citations
A Unified Model for Multi-class Anomaly Detection
NeurIPS 2022
0
citations
Learning from Future: A Novel Self-Training Framework for Semantic Segmentation
NeurIPS 2022
0
citations
Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator
NeurIPS 2022
0
citations
Improving GANs with A Dynamic Discriminator
NeurIPS 2022
0
citations
Benchmarking and Analyzing 3D-aware Image Synthesis with a Modularized Codebase
NeurIPS 2023
0
citations
Learning Modulated Transformation in GANs
NeurIPS 2023
0
citations
VideoComposer: Compositional Video Synthesis with Motion Controllability
NeurIPS 2023
0
citations
Revisiting the Evaluation of Image Synthesis with GANs
NeurIPS 2023
0
citations
FaceComposer: A Unified Model for Versatile Facial Content Creation
NeurIPS 2023
0
citations
Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from Backbone
NeurIPS 2023
0
citations
Customizable Image Synthesis with Multiple Subjects
NeurIPS 2023
0
citations
Compact Neural Volumetric Video Representations with Dynamic Codebooks
NeurIPS 2023
0
citations