Dong Xu
9
Papers
26
Total Citations
Papers (9)
A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing
CVPR 2024
15
citations
CAD-Coder: Text-to-CAD Generation with Chain-of-Thought and Geometric Reward
NeurIPS 2025
9
citations
TCM-Ladder: A Benchmark for Multimodal Question Answering on Traditional Chinese Medicine
NeurIPS 2025arXiv
2
citations
Data-Free Generalized Zero-Shot Learning
AAAI 2024arXiv
0
citations
UFDA: Universal Federated Domain Adaptation with Practical Assumptions
AAAI 2024
0
citations
SVGDreamer: Text Guided SVG Generation with Diffusion Model
CVPR 2024
0
citations
Multi-Modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation
AAAI 2024
0
citations
On-Device Diffusion Transformer Policy for Efficient Robot Manipulation
ICCV 2025
0
citations
Empowering LLMs to Understand and Generate Complex Vector Graphics
CVPR 2025
0
citations