Qifeng Chen

34
Papers
843
Total Citations

Papers (34)

Follow Your Pose: Pose-Guided Text-to-Video Generation Using Pose-Free Videos

AAAI 2024arXiv
276
citations

ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models

ICLR 2024
110
citations

Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners

CVPR 2024
109
citations

DiT4Edit: Diffusion Transformer for Image Editing

AAAI 2025
69
citations

DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation

CVPR 2024
62
citations

Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation

ECCV 2024
50
citations

VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling

CVPR 2025
31
citations

MagicQuill: An Intelligent Interactive Image Editing System

CVPR 2025
25
citations

LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis

CVPR 2025arXiv
25
citations

SPIRE: Semantic Prompt-Driven Image Restoration

ECCV 2024arXiv
19
citations

Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis

CVPR 2025
15
citations

SkillMimic: Learning Basketball Interaction Skills from Demonstrations

CVPR 2025arXiv
13
citations

Robust Depth Enhancement via Polarization Prompt Fusion Tuning

CVPR 2024
11
citations

MagicColor: Multi-instance Sketch Colorization

ICCV 2025
10
citations

Automatic Controllable Colorization via Imagination

CVPR 2024
8
citations

Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection

ECCV 2024
5
citations

Hints of Prompt: Enhancing Visual Representation for Multimodal LLMs in Autonomous Driving

ICCV 2025arXiv
4
citations

RGE-GS: Reward-Guided Expansive Driving Scene Reconstruction via Diffusion Priors

ICCV 2025
1
citations

A Diffusion Model with State Estimation for Degradation-Blind Inverse Imaging

AAAI 2024
0
citations

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

CVPR 2024
0
citations

MangaNinja: Line Art Colorization with Precise Reference Following

CVPR 2025
0
citations

Gaussian Shell Maps for Efficient 3D Human Generation

CVPR 2024
0
citations

AvatarArtist: Open-Domain 4D Avatarization

CVPR 2025
0
citations

Using Left and Right Brains Together: Towards Vision and Language Planning

ICML 2024
0
citations

Edicho: Consistent Image Editing in the Wild

ICCV 2025
0
citations

SafetyDPO: Scalable Safety Alignment for Text-to-Image Generation

ICCV 2025
0
citations

EEdit : Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing

ICCV 2025
0
citations

Instruction-based Image Editing with Planning, Reasoning, and Generation

ICCV 2025
0
citations

VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE

ICCV 2025
0
citations

Rethinking Layered Graphic Design Generation with a Top-Down Approach

ICCV 2025
0
citations

Infinite-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation

AAAI 2025
0
citations

VideoDPO: Omni-Preference Alignment for Video Diffusion Generation

CVPR 2025
0
citations

Follow-Your-Click: Open-domain Regional Image Animation via Motion Prompts

AAAI 2025
0
citations

Multitarget Device-Free Localization via Cross-Domain Wi-Fi RSS Training Data and Attentional Prior Fusion

AAAI 2024
0
citations