Zuxuan Wu
23
Papers
350
Total Citations
Papers (23)
SimDA: Simple Diffusion Adapter for Efficient Video Generation
CVPR 2024
106
citations
StableAnimator: High-Quality Identity-Preserving Human Image Animation
CVPR 2025
59
citations
CreatiLayout: Siamese Multimodal Diffusion Transformer for Creative Layout-to-Image Generation
ICCV 2025arXiv
33
citations
OmniViD: A Generative Framework for Universal Video Understanding
CVPR 2024
29
citations
AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction
ICCV 2025
24
citations
MotionFollower: Editing Video Motion via Score-Guided Diffusion
ICCV 2025
22
citations
PromptFusion: Decoupling Stability and Plasticity for Continual Learning
ECCV 2024
21
citations
AdaDiff: Adaptive Step Selection for Fast Diffusion Models
AAAI 2025
19
citations
MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance
ICCV 2025
17
citations
EDEN: Enhanced Diffusion for High-quality Large-motion Video Frame Interpolation
CVPR 2025
8
citations
Learning to Rank Patches for Unbiased Image Redundancy Reduction
CVPR 2024
6
citations
REDUCIO! Generating 1K Video within 16 Seconds using Extremely Compressed Motion Latents
ICCV 2025
5
citations
Achieving More with Less: Additive Prompt Tuning for Rehearsal-Free Class-Incremental Learning
ICCV 2025
1
citations
MotionEditor: Editing Video Motion via Content-Aware Diffusion
CVPR 2024
0
citations
BlockDance: Reuse Structurally Similar Spatio-Temporal Features to Accelerate Diffusion Transformers
CVPR 2025
0
citations
VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks
ICCV 2025
0
citations
Rethinking Discrete Tokens: Treating Them as Conditions for Continuous Autoregressive Image Synthesis
ICCV 2025
0
citations
Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop Training
ICCV 2025
0
citations
Comprehensive Multi-Modal Prototypes Are Simple and Effective Classifiers for Vast-Vocabulary Object Detection
AAAI 2025
0
citations
FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-from-gradients
AAAI 2025
0
citations
FOCUS: Towards Universal Foreground Segmentation
AAAI 2025
0
citations
Synthesize Diagnose and Optimize: Towards Fine-Grained Vision-Language Understanding
CVPR 2024
0
citations
BEVNeXt: Reviving Dense BEV Frameworks for 3D Object Detection
CVPR 2024
0
citations