Qifeng Chen

102
Papers
828
Total Citations

Papers (102)

Follow Your Pose: Pose-Guided Text-to-Video Generation Using Pose-Free Videos

AAAI 2024arXiv
276
citations

ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models

ICLR 2024
110
citations

Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners

CVPR 2024
109
citations

DiT4Edit: Diffusion Transformer for Image Editing

AAAI 2025
69
citations

DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation

CVPR 2024
62
citations

Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation

ECCV 2024
50
citations

VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling

CVPR 2025
31
citations

MagicQuill: An Intelligent Interactive Image Editing System

CVPR 2025
25
citations

LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis

CVPR 2025arXiv
25
citations

SPIRE: Semantic Prompt-Driven Image Restoration

ECCV 2024arXiv
19
citations

Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis

CVPR 2025
15
citations

SkillMimic: Learning Basketball Interaction Skills from Demonstrations

CVPR 2025
12
citations

Robust Depth Enhancement via Polarization Prompt Fusion Tuning

CVPR 2024
11
citations

Automatic Controllable Colorization via Imagination

CVPR 2024
8
citations

Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection

ECCV 2024
5
citations

RGE-GS: Reward-Guided Expansive Driving Scene Reconstruction via Diffusion Priors

ICCV 2025
1
citations

CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

CVPR 2024
0
citations

Gaussian Shell Maps for Efficient 3D Human Generation

CVPR 2024
0
citations

Using Left and Right Brains Together: Towards Vision and Language Planning

ICML 2024
0
citations

Dense Monocular Depth Estimation in Complex Dynamic Scenes

CVPR 2016
0
citations

Full Flow: Optical Flow Estimation By Global Optimization Over Regular Grids

CVPR 2016
0
citations

Interactive Image Segmentation With Latent Diversity

CVPR 2018
0
citations

Learning to See in the Dark

CVPR 2018arXiv
0
citations

Single Image Reflection Separation With Perceptual Losses

CVPR 2018arXiv
0
citations

Semi-Parametric Image Synthesis

CVPR 2018arXiv
0
citations

Fully Automatic Video Colorization With Self-Regularization and Diversity

CVPR 2019
0
citations

Zoom to Learn, Learn to Zoom

CVPR 2019
0
citations

3D Motion Decomposition for RGBD Future Dynamic Scene Synthesis

CVPR 2019
0
citations

Polarized Reflection Removal With Perfect Alignment in the Wild

CVPR 2020arXiv
0
citations

Depth Sensing Beyond LiDAR Range

CVPR 2020arXiv
0
citations

Dynamic Hierarchical Mimicking Towards Consistent Optimization Objectives

CVPR 2020arXiv
0
citations

Future Video Synthesis With Object Motion Prediction

CVPR 2020arXiv
0
citations

Image Inpainting With External-Internal Learning and Monochromic Bottleneck

CVPR 2021arXiv
0
citations

Invertible Image Signal Processing

CVPR 2021arXiv
0
citations

Involution: Inverting the Inherence of Convolution for Visual Recognition

CVPR 2021arXiv
0
citations

Robust Reflection Removal With Reflection-Free Flash-Only Cues

CVPR 2021arXiv
0
citations

FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation

CVPR 2021arXiv
0
citations

Neural Camera Simulators

CVPR 2021arXiv
0
citations

TPCN: Temporal Point Cloud Networks for Motion Forecasting

CVPR 2021arXiv
0
citations

Shape From Polarization for Complex Scenes in the Wild

CVPR 2022arXiv
0
citations

FS6D: Few-Shot 6D Pose Estimation of Novel Objects

CVPR 2022arXiv
0
citations

Optimizing Video Prediction via Video Frame Interpolation

CVPR 2022
0
citations

High-Fidelity GAN Inversion for Image Attribute Editing

CVPR 2022arXiv
0
citations

RODIN: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion

CVPR 2023arXiv
0
citations

MetaPortrait: Identity-Preserving Talking Head Generation With Fast Personalized Adaptation

CVPR 2023arXiv
0
citations

Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint

CVPR 2023arXiv
0
citations

Enlarging Instance-Specific and Class-Specific Information for Open-Set Action Recognition

CVPR 2023arXiv
0
citations

Learning 3D-Aware Image Synthesis With Unknown Pose Distribution

CVPR 2023arXiv
0
citations

Blind Video Deflickering by Neural Filtering With a Flawed Atlas

CVPR 2023arXiv
0
citations

Real-Time 6K Image Rescaling With Rate-Distortion Optimization

CVPR 2023arXiv
0
citations

DynaFed: Tackling Client Data Heterogeneity With Global Dynamics

CVPR 2023arXiv
0
citations

High-Fidelity 3D GAN Inversion by Pseudo-Multi-View Optimization

CVPR 2023arXiv
0
citations

Robust Nonrigid Registration by Convex Optimization

ICCV 2015
0
citations

Photographic Image Synthesis With Cascaded Refinement Networks

ICCV 2017arXiv
0
citations

Fast Image Processing With Fully-Convolutional Networks

ICCV 2017arXiv
0
citations

Hiding Video in Audio via Reversible Generative Models

ICCV 2019
0
citations

Seeing Motion in the Dark

ICCV 2019
0
citations

Normalized Human Pose Features for Human Action Video Alignment

ICCV 2021
0
citations

IICNet: A Generic Framework for Reversible Image Conversion

ICCV 2021arXiv
0
citations

Embedding Novel Views in a Single JPEG Image

ICCV 2021arXiv
0
citations

DRINet: A Dual-Representation Iterative Learning Network for Point Cloud Segmentation

ICCV 2021arXiv
0
citations

Dual-Camera Super-Resolution With Aligned Attention Modules

ICCV 2021arXiv
0
citations

Internal Video Inpainting by Implicit Long-Range Propagation

ICCV 2021arXiv
0
citations

LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis

ICCV 2023arXiv
0
citations

Randomized Quantization: A Generic Augmentation for Data Agnostic Self-supervised Learning

ICCV 2023arXiv
0
citations

Bootstrap Motion Forecasting With Self-Consistent Constraints

ICCV 2023arXiv
0
citations

FateZero: Fusing Attentions for Zero-shot Text-based Video Editing

ICCV 2023arXiv
0
citations

Deep Reinforced Attention Learning for Quality-Aware Visual Recognition

ECCV 2020
0
citations

PiP: Planning-informed Trajectory Prediction for Autonomous Driving

ECCV 2020
0
citations

PSConv: Squeezing Feature Pyramid into One Compact Poly-Scale Convolutional Layer

ECCV 2020
0
citations

Fully Convolutional Networks for Continuous Sign Language Recognition

ECCV 2020
0
citations

Learning to Learn Parameterized Classification Networks for Scalable Input Images

ECCV 2020
0
citations

3D-Aware Indoor Scene Synthesis with Depth Priors

ECCV 2022
0
citations

Optimizing Image Compression via Joint Learning with Denoising

ECCV 2022
0
citations

Real-Time Neural Character Rendering with Pose-Guided Multiplane Images

ECCV 2022
0
citations

Point Cloud Compression with Sibling Context and Surface Priors

ECCV 2022
0
citations

Efficient Point Cloud Segmentation with Geometry-Aware Sparse Networks

ECCV 2022
0
citations

Safety-Aware Motion Prediction With Unseen Vehicles for Autonomous Driving

ICCV 2021arXiv
0
citations

AvatarArtist: Open-Domain 4D Avatarization

CVPR 2025
0
citations

MangaNinja: Line Art Colorization with Precise Reference Following

CVPR 2025
0
citations

VideoDPO: Omni-Preference Alignment for Video Diffusion Generation

CVPR 2025
0
citations

Hints of Prompt: Enhancing Visual Representation for Multimodal LLMs in Autonomous Driving

ICCV 2025
0
citations

Edicho: Consistent Image Editing in the Wild

ICCV 2025
0
citations

SafetyDPO: Scalable Safety Alignment for Text-to-Image Generation

ICCV 2025
0
citations

EEdit : Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing

ICCV 2025
0
citations

Instruction-based Image Editing with Planning, Reasoning, and Generation

ICCV 2025
0
citations

VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE

ICCV 2025
0
citations

MagicColor: Multi-instance Sketch Colorization

ICCV 2025
0
citations

Rethinking Layered Graphic Design Generation with a Top-Down Approach

ICCV 2025
0
citations

Infinite-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation

AAAI 2025
0
citations

Follow-Your-Click: Open-domain Regional Image Animation via Motion Prompts

AAAI 2025
0
citations

Multitarget Device-Free Localization via Cross-Domain Wi-Fi RSS Training Data and Attentional Prior Fusion

AAAI 2024
0
citations

A Diffusion Model with State Estimation for Degradation-Blind Inverse Imaging

AAAI 2024
0
citations

Combinatorial Optimization with Graph Convolutional Networks and Guided Tree Search

NeurIPS 2018
0
citations

Blind Video Temporal Consistency via Deep Video Prior

NeurIPS 2020
0
citations

Low-Rank Subspaces in GANs

NeurIPS 2021
0
citations

Planning for Sample Efficient Imitation Learning

NeurIPS 2022
0
citations

Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator

NeurIPS 2022
0
citations

One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations

NeurIPS 2022
0
citations

AniFaceGAN: Animatable 3D-Aware Face Image Generation for Video Avatars

NeurIPS 2022
0
citations

TextDiffuser: Diffusion Models as Text Painters

NeurIPS 2023
0
citations

4D Panoptic Scene Graph Generation

NeurIPS 2023
0
citations