Xinlong Wang

23
Papers
654
Total Citations

Papers (23)

Generative Multimodal Models are In-Context Learners

CVPR 2024
422
citations

Uni3D: Exploring Unified 3D Representation at Scale

ICLR 2024
165
citations

You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale

CVPR 2025arXiv
49
citations

EVEv2: Improved Baselines for Encoder-Free Vision-Language Models

ICCV 2025
18
citations

Repulsion Loss: Detecting Pedestrians in a Crowd

CVPR 2018arXiv
0
citations

Associatively Segmenting Instances and Semantics in Point Clouds

CVPR 2019
0
citations

End-to-End Video Instance Segmentation With Transformers

CVPR 2021arXiv
0
citations

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

CVPR 2021arXiv
0
citations

BoxInst: High-Performance Instance Segmentation With Box Annotations

CVPR 2021arXiv
0
citations

FreeSOLO: Learning To Segment Objects Without Annotations

CVPR 2022arXiv
0
citations

EVA: Exploring the Limits of Masked Visual Representation Learning at Scale

CVPR 2023arXiv
0
citations

Images Speak in Images: A Generalist Painter for In-Context Visual Learning

CVPR 2023arXiv
0
citations

SegGPT: Towards Segmenting Everything in Context

ICCV 2023
0
citations

Affective Image Filter: Reflecting Emotions from Text to Images

ICCV 2023
0
citations

SOLO: Segmenting Objects by Locations

ECCV 2020
0
citations

Instance-Aware Embedding for Point Cloud Instance Segmentation

ECCV 2020
0
citations

Poseur: Direct Human Pose Regression with Transformers

ECCV 2022
0
citations

FCPose: Fully Convolutional Multi-Person Pose Estimation With Dynamic Instance-Aware Convolutions

CVPR 2021arXiv
0
citations

Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentation

CVPR 2024
0
citations

CapsFusion: Rethinking Image-Text Data at Scale

CVPR 2024
0
citations

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

ICML 2024
0
citations

SOLOv2: Dynamic and Fast Instance Segmentation

NeurIPS 2020
0
citations

Fine-Grained Visual Prompting

NeurIPS 2023
0
citations