Yun Xing
4
Papers
54
Total Citations
Papers (4)
The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
NeurIPS 2025
26
citations
HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras
ECCV 2024arXiv
19
citations
SceneTAP: Scene-Coherent Typographic Adversarial Planner against Vision-Language Models in Real-World Environments
CVPR 2025
9
citations
Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining
CVPR 2024
0
citations