Tingyu Weng
3
Papers
23
Total Citations
Papers (3)
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface
NeurIPS 2025arXiv
14
citations
Aligned Better, Listen Better for Audio-Visual Large Language Models
ICLR 2025
8
citations
DynImg: Key Frames with Visual Prompts are Good Representation for Multi-Modal Video Understanding
ICCV 2025
1
citations