Jiedong Zhuang
4
Papers
23
Total Citations
Papers (4)
Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints
AAAI 2025
13
citations
FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot Performance
ECCV 2024
7
citations
PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination
ICCV 2025
3
citations
ST3: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming
AAAI 2025
0
citations