Hongfa Wang
5
Papers
360
Total Citations
Papers (5)
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
ICLR 2024
343
citations
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models
ICCV 2025
12
citations
CoHD: A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation
ICCV 2025
5
citations
Infinite-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation
AAAI 2025
0
citations
Follow-Your-Click: Open-domain Regional Image Animation via Motion Prompts
AAAI 2025
0
citations