Can Huang
4
Papers
16
Total Citations
Papers (4)
Dynamic-VLM: Simple Dynamic Visual Token Compression for VideoLLM
ICCV 2025
14
citations
GLOMA: Global Video Text Spotting with Morphological Association
ICLR 2025
2
citations
ParGo: Bridging Vision-Language with Partial and Global Views
AAAI 2025
0
citations
Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer
CVPR 2024
0
citations