Yanting Zhang
5
Papers
465
Total Citations
Papers (5)
MovieChat: From Dense Token to Sparse Memory for Long Video Understanding
CVPR 2024
457
citations
ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction
AAAI 2025
5
citations
ArchCAD-400K: A Large-Scale CAD drawings Dataset and New Baseline for Panoptic Symbol Spotting
NeurIPS 2025arXiv
2
citations
Point or Line? Using Line-based Representation for Panoptic Symbol Spotting in CAD Drawings
NeurIPS 2025
1
citations
UniAP: Towards Universal Animal Perception in Vision via Few-Shot Learning
AAAI 2024
0
citations