Xiaofang Wang
4
Papers
7
Total Citations
Papers (4)
Building a Mind Palace: Structuring Environment-Grounded Semantic Graphs for Effective Long Video Analysis with LLMs
CVPR 2025arXiv
7
citations
Accelerating Multimodal Large Language Models by Searching Optimal Vision Token Reduction
CVPR 2025
0
citations
Apollo: An Exploration of Video Understanding in Large Multimodal Models
CVPR 2025
0
citations
ControlRoom3D: Room Generation using Semantic Proxy Rooms
CVPR 2024
0
citations