Jiannan Wu
7
Papers
2,210
Total Citations
Papers (7)
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks
CVPR 2024
2,210
citations
Language As Queries for Referring Video Object Segmentation
CVPR 2022arXiv
0
citations
Universal Instance Perception As Object Discovery and Retrieval
CVPR 2023arXiv
0
citations
Watch Only Once: An End-to-End Video Action Detection Framework
ICCV 2021
0
citations
Segment Every Reference Object in Spatial and Temporal Spaces
ICCV 2023
0
citations
Exploring Transformers for Open-world Instance Segmentation
ICCV 2023arXiv
0
citations
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
NeurIPS 2023
0
citations