Yiming Zhang
9
Papers
102
Total Citations
Papers (9)
STI-Bench: Are MLLMs Ready for Precise Spatial-Temporal World Understanding?
ICCV 2025
36
citations
Persistent Pre-training Poisoning of LLMs
ICLR 2025
34
citations
VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior
ICCV 2025arXiv
17
citations
RANKCLIP: Ranking-Consistent Language-Image Pretraining
ICCV 2025
10
citations
InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion
NeurIPS 2025
2
citations
Towards Large-Scale In-Context Reinforcement Learning by Meta-Training in Randomized Worlds
NeurIPS 2025
2
citations
Table as a Modality for Large Language Models
NeurIPS 2025
1
citations
Beyond Training: Dynamic Token Merging for Zero-Shot Video Understanding
ICCV 2025
0
citations
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models
CVPR 2024
0
citations