Qifeng Chen
34
Papers
843
Total Citations
Papers (34)
Follow Your Pose: Pose-Guided Text-to-Video Generation Using Pose-Free Videos
AAAI 2024arXiv
276
citations
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models
ICLR 2024
110
citations
Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
CVPR 2024
109
citations
DiT4Edit: Diffusion Transformer for Image Editing
AAAI 2025
69
citations
DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation
CVPR 2024
62
citations
Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
ECCV 2024
50
citations
VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling
CVPR 2025
31
citations
MagicQuill: An Intelligent Interactive Image Editing System
CVPR 2025
25
citations
LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
CVPR 2025arXiv
25
citations
SPIRE: Semantic Prompt-Driven Image Restoration
ECCV 2024arXiv
19
citations
Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis
CVPR 2025
15
citations
SkillMimic: Learning Basketball Interaction Skills from Demonstrations
CVPR 2025arXiv
13
citations
Robust Depth Enhancement via Polarization Prompt Fusion Tuning
CVPR 2024
11
citations
MagicColor: Multi-instance Sketch Colorization
ICCV 2025
10
citations
Automatic Controllable Colorization via Imagination
CVPR 2024
8
citations
Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection
ECCV 2024
5
citations
Hints of Prompt: Enhancing Visual Representation for Multimodal LLMs in Autonomous Driving
ICCV 2025arXiv
4
citations
RGE-GS: Reward-Guided Expansive Driving Scene Reconstruction via Diffusion Priors
ICCV 2025
1
citations
A Diffusion Model with State Estimation for Degradation-Blind Inverse Imaging
AAAI 2024
0
citations
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
CVPR 2024
0
citations
MangaNinja: Line Art Colorization with Precise Reference Following
CVPR 2025
0
citations
Gaussian Shell Maps for Efficient 3D Human Generation
CVPR 2024
0
citations
AvatarArtist: Open-Domain 4D Avatarization
CVPR 2025
0
citations
Using Left and Right Brains Together: Towards Vision and Language Planning
ICML 2024
0
citations
Edicho: Consistent Image Editing in the Wild
ICCV 2025
0
citations
SafetyDPO: Scalable Safety Alignment for Text-to-Image Generation
ICCV 2025
0
citations
EEdit : Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing
ICCV 2025
0
citations
Instruction-based Image Editing with Planning, Reasoning, and Generation
ICCV 2025
0
citations
VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE
ICCV 2025
0
citations
Rethinking Layered Graphic Design Generation with a Top-Down Approach
ICCV 2025
0
citations
Infinite-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation
AAAI 2025
0
citations
VideoDPO: Omni-Preference Alignment for Video Diffusion Generation
CVPR 2025
0
citations
Follow-Your-Click: Open-domain Regional Image Animation via Motion Prompts
AAAI 2025
0
citations
Multitarget Device-Free Localization via Cross-Domain Wi-Fi RSS Training Data and Attentional Prior Fusion
AAAI 2024
0
citations