Yufei Zhan
4
Papers
32
Total Citations
Papers (4)
Griffon: Spelling out All Object Locations at Any Granularity with Large Language Models
ECCV 2024
30
citations
Seeing is Believing? Mitigating OCR Hallucinations in Multimodal Large Language Models
NeurIPS 2025
2
citations
Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring
ICCV 2025
0
citations
FOCUS: Unified Vision-Language Modeling for Interactive Editing Driven by Referential Segmentation
NeurIPS 2025arXiv
0
citations