Dongdong Chen
13
Papers
92
Total Citations
Papers (13)
OmniViD: A Generative Framework for Universal Video Understanding
CVPR 2024
29
citations
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation
ECCV 2024arXiv
17
citations
VLM4D: Towards Spatiotemporal Awareness in Vision Language Models
ICCV 2025
15
citations
FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing
ICCV 2025
12
citations
SmartEraser: Remove Anything from Images using Masked-Region Guidance
CVPR 2025
12
citations
Olympus: A Universal Task Router for Computer Vision Tasks
CVPR 2025
3
citations
UNICL-SAM: Uncertainty-Driven In-Context Segmentation with Part Prototype Discovery
CVPR 2025
3
citations
Exploring Invariance in Images through One-way Wave Equations
ICML 2025
1
citations
I2V3D: Controllable Image-to-video Generation with 3D Guidance
ICCV 2025
0
citations
Equivariant Multi-Modality Image Fusion
CVPR 2024
0
citations
Towards More Unified In-context Visual Understanding
CVPR 2024
0
citations
Show and Segment: Universal Medical Image Segmentation via In-Context Learning
CVPR 2025
0
citations
Image Fusion via Vision-Language Model
ICML 2024
0
citations