Zuxuan Wu

23
Papers
350
Total Citations

Papers (23)

SimDA: Simple Diffusion Adapter for Efficient Video Generation

CVPR 2024
106
citations

StableAnimator: High-Quality Identity-Preserving Human Image Animation

CVPR 2025
59
citations

CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation

ICCV 2025arXiv
33
citations

OmniViD: A Generative Framework for Universal Video Understanding

CVPR 2024
29
citations

AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction

ICCV 2025
24
citations

MotionFollower: Editing Video Motion via Score-Guided Diffusion

ICCV 2025
22
citations

PromptFusion: Decoupling Stability and Plasticity for Continual Learning

ECCV 2024
21
citations

AdaDiff: Adaptive Step Selection for Fast Diffusion Models

AAAI 2025
19
citations

MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance

ICCV 2025
17
citations

EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation

CVPR 2025
8
citations

Learning to Rank Patches for Unbiased Image Redundancy Reduction

CVPR 2024
6
citations

REDUCIO! Generating 1K Video within 16 Seconds using Extremely Compressed Motion Latents

ICCV 2025
5
citations

Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning

ICCV 2025
1
citations

MotionEditor: Editing Video Motion via Content-Aware Diffusion

CVPR 2024
0
citations

BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers

CVPR 2025
0
citations

VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks

ICCV 2025
0
citations

Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis

ICCV 2025
0
citations

Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop Training

ICCV 2025
0
citations

Comprehensive Multi-Modal Prototypes Are Simple and Effective Classifiers for Vast-Vocabulary Object Detection

AAAI 2025
0
citations

FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-from-gradients

AAAI 2025
0
citations

FOCUS: Towards Universal Foreground Segmentation

AAAI 2025
0
citations

Synthesize Diagnose and Optimize: Towards Fine-Grained Vision-Language Understanding

CVPR 2024
0
citations

BEVNeXt: Reviving Dense BEV Frameworks for 3D Object Detection

CVPR 2024
0
citations