Xueyan Zou
7
Papers
168
Total Citations
Papers (7)
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
ECCV 2024arXiv
114
citations
Visual In-Context Prompting
CVPR 2024
52
citations
3D-SPATIAL MULTIMODAL MEMORY
ICLR 2025
2
citations
Progressive Temporal Feature Alignment Network for Video Inpainting
CVPR 2021arXiv
0
citations
Generalized Decoding for Pixel, Image, and Language
CVPR 2023arXiv
0
citations
A Simple Framework for Open-Vocabulary Segmentation and Detection
ICCV 2023arXiv
0
citations
Segment Everything Everywhere All at Once
NeurIPS 2023
0
citations