Qifeng Chen
102
Papers
828
Total Citations
Papers (102)
Follow Your Pose: Pose-Guided Text-to-Video Generation Using Pose-Free Videos
AAAI 2024arXiv
276
citations
ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models
ICLR 2024
110
citations
Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners
CVPR 2024
109
citations
DiT4Edit: Diffusion Transformer for Image Editing
AAAI 2025
69
citations
DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation
CVPR 2024
62
citations
Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation
ECCV 2024
50
citations
VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling
CVPR 2025
31
citations
MagicQuill: An Intelligent Interactive Image Editing System
CVPR 2025
25
citations
LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
CVPR 2025arXiv
25
citations
SPIRE: Semantic Prompt-Driven Image Restoration
ECCV 2024arXiv
19
citations
Exploring Sparse MoE in GANs for Text-conditioned Image Synthesis
CVPR 2025
15
citations
SkillMimic: Learning Basketball Interaction Skills from Demonstrations
CVPR 2025
12
citations
Robust Depth Enhancement via Polarization Prompt Fusion Tuning
CVPR 2024
11
citations
Automatic Controllable Colorization via Imagination
CVPR 2024
8
citations
Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection
ECCV 2024
5
citations
RGE-GS: Reward-Guided Expansive Driving Scene Reconstruction via Diffusion Priors
ICCV 2025
1
citations
CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
CVPR 2024
0
citations
Gaussian Shell Maps for Efficient 3D Human Generation
CVPR 2024
0
citations
Using Left and Right Brains Together: Towards Vision and Language Planning
ICML 2024
0
citations
Dense Monocular Depth Estimation in Complex Dynamic Scenes
CVPR 2016
0
citations
Full Flow: Optical Flow Estimation By Global Optimization Over Regular Grids
CVPR 2016
0
citations
Interactive Image Segmentation With Latent Diversity
CVPR 2018
0
citations
Learning to See in the Dark
CVPR 2018arXiv
0
citations
Single Image Reflection Separation With Perceptual Losses
CVPR 2018arXiv
0
citations
Semi-Parametric Image Synthesis
CVPR 2018arXiv
0
citations
Fully Automatic Video Colorization With Self-Regularization and Diversity
CVPR 2019
0
citations
Zoom to Learn, Learn to Zoom
CVPR 2019
0
citations
3D Motion Decomposition for RGBD Future Dynamic Scene Synthesis
CVPR 2019
0
citations
Polarized Reflection Removal With Perfect Alignment in the Wild
CVPR 2020arXiv
0
citations
Depth Sensing Beyond LiDAR Range
CVPR 2020arXiv
0
citations
Dynamic Hierarchical Mimicking Towards Consistent Optimization Objectives
CVPR 2020arXiv
0
citations
Future Video Synthesis With Object Motion Prediction
CVPR 2020arXiv
0
citations
Image Inpainting With External-Internal Learning and Monochromic Bottleneck
CVPR 2021arXiv
0
citations
Invertible Image Signal Processing
CVPR 2021arXiv
0
citations
Involution: Inverting the Inherence of Convolution for Visual Recognition
CVPR 2021arXiv
0
citations
Robust Reflection Removal With Reflection-Free Flash-Only Cues
CVPR 2021arXiv
0
citations
FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation
CVPR 2021arXiv
0
citations
Neural Camera Simulators
CVPR 2021arXiv
0
citations
TPCN: Temporal Point Cloud Networks for Motion Forecasting
CVPR 2021arXiv
0
citations
Shape From Polarization for Complex Scenes in the Wild
CVPR 2022arXiv
0
citations
FS6D: Few-Shot 6D Pose Estimation of Novel Objects
CVPR 2022arXiv
0
citations
Optimizing Video Prediction via Video Frame Interpolation
CVPR 2022
0
citations
High-Fidelity GAN Inversion for Image Attribute Editing
CVPR 2022arXiv
0
citations
RODIN: A Generative Model for Sculpting 3D Digital Avatars Using Diffusion
CVPR 2023arXiv
0
citations
MetaPortrait: Identity-Preserving Talking Head Generation With Fast Personalized Adaptation
CVPR 2023arXiv
0
citations
Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint
CVPR 2023arXiv
0
citations
Enlarging Instance-Specific and Class-Specific Information for Open-Set Action Recognition
CVPR 2023arXiv
0
citations
Learning 3D-Aware Image Synthesis With Unknown Pose Distribution
CVPR 2023arXiv
0
citations
Blind Video Deflickering by Neural Filtering With a Flawed Atlas
CVPR 2023arXiv
0
citations
Real-Time 6K Image Rescaling With Rate-Distortion Optimization
CVPR 2023arXiv
0
citations
DynaFed: Tackling Client Data Heterogeneity With Global Dynamics
CVPR 2023arXiv
0
citations
High-Fidelity 3D GAN Inversion by Pseudo-Multi-View Optimization
CVPR 2023arXiv
0
citations
Robust Nonrigid Registration by Convex Optimization
ICCV 2015
0
citations
Photographic Image Synthesis With Cascaded Refinement Networks
ICCV 2017arXiv
0
citations
Fast Image Processing With Fully-Convolutional Networks
ICCV 2017arXiv
0
citations
Hiding Video in Audio via Reversible Generative Models
ICCV 2019
0
citations
Seeing Motion in the Dark
ICCV 2019
0
citations
Normalized Human Pose Features for Human Action Video Alignment
ICCV 2021
0
citations
IICNet: A Generic Framework for Reversible Image Conversion
ICCV 2021arXiv
0
citations
Embedding Novel Views in a Single JPEG Image
ICCV 2021arXiv
0
citations
DRINet: A Dual-Representation Iterative Learning Network for Point Cloud Segmentation
ICCV 2021arXiv
0
citations
Dual-Camera Super-Resolution With Aligned Attention Modules
ICCV 2021arXiv
0
citations
Internal Video Inpainting by Implicit Long-Range Propagation
ICCV 2021arXiv
0
citations
LinkGAN: Linking GAN Latents to Pixels for Controllable Image Synthesis
ICCV 2023arXiv
0
citations
Randomized Quantization: A Generic Augmentation for Data Agnostic Self-supervised Learning
ICCV 2023arXiv
0
citations
Bootstrap Motion Forecasting With Self-Consistent Constraints
ICCV 2023arXiv
0
citations
FateZero: Fusing Attentions for Zero-shot Text-based Video Editing
ICCV 2023arXiv
0
citations
Deep Reinforced Attention Learning for Quality-Aware Visual Recognition
ECCV 2020
0
citations
PiP: Planning-informed Trajectory Prediction for Autonomous Driving
ECCV 2020
0
citations
PSConv: Squeezing Feature Pyramid into One Compact Poly-Scale Convolutional Layer
ECCV 2020
0
citations
Fully Convolutional Networks for Continuous Sign Language Recognition
ECCV 2020
0
citations
Learning to Learn Parameterized Classification Networks for Scalable Input Images
ECCV 2020
0
citations
3D-Aware Indoor Scene Synthesis with Depth Priors
ECCV 2022
0
citations
Optimizing Image Compression via Joint Learning with Denoising
ECCV 2022
0
citations
Real-Time Neural Character Rendering with Pose-Guided Multiplane Images
ECCV 2022
0
citations
Point Cloud Compression with Sibling Context and Surface Priors
ECCV 2022
0
citations
Efficient Point Cloud Segmentation with Geometry-Aware Sparse Networks
ECCV 2022
0
citations
Safety-Aware Motion Prediction With Unseen Vehicles for Autonomous Driving
ICCV 2021arXiv
0
citations
AvatarArtist: Open-Domain 4D Avatarization
CVPR 2025
0
citations
MangaNinja: Line Art Colorization with Precise Reference Following
CVPR 2025
0
citations
VideoDPO: Omni-Preference Alignment for Video Diffusion Generation
CVPR 2025
0
citations
Hints of Prompt: Enhancing Visual Representation for Multimodal LLMs in Autonomous Driving
ICCV 2025
0
citations
Edicho: Consistent Image Editing in the Wild
ICCV 2025
0
citations
SafetyDPO: Scalable Safety Alignment for Text-to-Image Generation
ICCV 2025
0
citations
EEdit : Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing
ICCV 2025
0
citations
Instruction-based Image Editing with Planning, Reasoning, and Generation
ICCV 2025
0
citations
VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE
ICCV 2025
0
citations
MagicColor: Multi-instance Sketch Colorization
ICCV 2025
0
citations
Rethinking Layered Graphic Design Generation with a Top-Down Approach
ICCV 2025
0
citations
Infinite-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation
AAAI 2025
0
citations
Follow-Your-Click: Open-domain Regional Image Animation via Motion Prompts
AAAI 2025
0
citations
Multitarget Device-Free Localization via Cross-Domain Wi-Fi RSS Training Data and Attentional Prior Fusion
AAAI 2024
0
citations
A Diffusion Model with State Estimation for Degradation-Blind Inverse Imaging
AAAI 2024
0
citations
Combinatorial Optimization with Graph Convolutional Networks and Guided Tree Search
NeurIPS 2018
0
citations
Blind Video Temporal Consistency via Deep Video Prior
NeurIPS 2020
0
citations
Low-Rank Subspaces in GANs
NeurIPS 2021
0
citations
Planning for Sample Efficient Imitation Learning
NeurIPS 2022
0
citations
Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator
NeurIPS 2022
0
citations
One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
NeurIPS 2022
0
citations
AniFaceGAN: Animatable 3D-Aware Face Image Generation for Video Avatars
NeurIPS 2022
0
citations
TextDiffuser: Diffusion Models as Text Painters
NeurIPS 2023
0
citations
4D Panoptic Scene Graph Generation
NeurIPS 2023
0
citations