Yongqin Xian
6
Papers
51
Total Citations
Papers (6)
PALM: Predicting Actions through Language Models
ECCV 2024arXiv
22
citations
Omnia de EgoTempo: Benchmarking Temporal Understanding of Multi-Modal LLMs in Egocentric Videos
CVPR 2025
14
citations
Active Data Curation Effectively Distills Large-Scale Multimodal Models
CVPR 2025
14
citations
UIP2P: Unsupervised Instruction-based Image Editing via Edit Reversibility Constraint
ICCV 2025arXiv
1
citations
LOGICZSL: Exploring Logic-induced Representation for Compositional Zero-shot Learning
CVPR 2025
0
citations
MOBIUS: Big-to-Mobile Universal Instance Segmentation via Multi-modal Bottleneck Fusion and Calibrated Decoder Pruning
ICCV 2025
0
citations