Jing Bi
5
Papers
47
Total Citations
Papers (5)
Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding
AAAI 2025
24
citations
VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?
CVPR 2025
16
citations
MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness
NeurIPS 2025
4
citations
ZeroSep: Separate Anything in Audio with Zero Training
NeurIPS 2025
3
citations
Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach
CVPR 2025
0
citations