Ming Dai
4
Papers
16
Total Citations
Papers (4)
Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints
AAAI 2025
13
citations
PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination
ICCV 2025
3
citations
DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy
ICCV 2025
0
citations
ST3: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming
AAAI 2025
0
citations