"multimodal feature interaction" Papers
2 papers found
Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video Grounding
Xin Gu, Yaojie Shen, Chenxi Luo et al.
ICLR 2025oralarXiv:2502.11168
8
citations
PC-Net: Weakly Supervised Compositional Moment Retrieval via Proposal-Centric Network
Mingyao Zhou, Hao Sun, Wei Xie et al.
NeurIPS 2025oral