Ziqi Pang
5
Papers
150
Total Citations
Papers (5)
RandAR: Decoder-only Autoregressive Visual Generation in Random Orders
CVPR 2025
61
citations
Frozen Transformers in Language Models Are Effective Visual Encoder Layers
ICLR 2024
48
citations
GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation
CVPR 2025
21
citations
RMem: Restricted Memory Banks Improve Video Object Segmentation
CVPR 2024
18
citations
AgMMU: A Comprehensive Agricultural Multimodal Understanding Benchmark
NeurIPS 2025
2
citations