Deli Zhao
12
Papers
204
Total Citations
Papers (12)
Space Group Constrained Crystal Generation
ICLR 2024
60
citations
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM
CVPR 2025
40
citations
Latent Space Editing in Transformer-Based Flow Matching
AAAI 2024arXiv
38
citations
The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
NeurIPS 2025
26
citations
Lipschitz Singularities in Diffusion Models
ICLR 2024
21
citations
MolSpectra: Pre-training 3D Molecular Representation with Multi-modal Energy Spectra
ICLR 2025
9
citations
Towards More Accurate Diffusion Model Acceleration with A Timestep Tuner
CVPR 2024
9
citations
Universally Invariant Learning in Equivariant GNNs
NeurIPS 2025
1
citations
AnyDoor: Zero-shot Object-level Image Customization
CVPR 2024
0
citations
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
ICCV 2025
0
citations
Towards Scalable Spatial Intelligence via 2D-to-3D Data Lifting
ICCV 2025
0
citations
Breaking the Memory Barrier of Contrastive Loss via Tile-Based Strategy
CVPR 2025
0
citations