Heng Wang
5
Papers
74
Total Citations
Papers (5)
V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models
AAAI 2024arXiv
74
citations
ROS-SAM: High-Quality Interactive Segmentation for Remote Sensing Moving Object
CVPR 2025
0
citations
Stitching Segments and Sentences towards Generalization in Video-Text Pre-training
AAAI 2024
0
citations
Video Recognition in Portrait Mode
CVPR 2024
0
citations
VISTA-LLAMA: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens
CVPR 2024
0
citations