Xinggang Wang

53
Papers
1,897
Total Citations

Papers (53)

4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

CVPR 2024
1,061
citations

GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models

CVPR 2024
241
citations

Boundary-preserving Mask R-CNN

ECCV 2020
237
citations

Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

CVPR 2025
159
citations

Lane Graph as Path: Continuity-preserving Path-wise Modeling for Online Lane Graph Construction

ECCV 2024
56
citations

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

NeurIPS 2025
43
citations

DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention

CVPR 2025
38
citations

GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding

CVPR 2025
30
citations

MobileInst: Video Instance Segmentation on the Mobile

AAAI 2024arXiv
10
citations

ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention

AAAI 2025
8
citations

Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation

CVPR 2025
7
citations

GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding

ICCV 2025
5
citations

MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling

ICCV 2025
2
citations

Human De-Occlusion: Invisible Perception and Recovery for Humans

CVPR 2021
0
citations

Weakly-Supervised Instance Segmentation via Class-Agnostic Learning With Salient Images

CVPR 2021arXiv
0
citations

MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens

CVPR 2022
0
citations

AziNorm: Exploiting the Radial Symmetry of Point Cloud for Azimuth-Normalized 3D Perception

CVPR 2022arXiv
0
citations

Sparse Instance Activation for Real-Time Instance Segmentation

CVPR 2022arXiv
0
citations

Temporally Efficient Vision Transformer for Video Instance Segmentation

CVPR 2022arXiv
0
citations

TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation

CVPR 2022arXiv
0
citations

PD-Quant: Post-Training Quantization Based on Prediction Difference Metric

CVPR 2023
0
citations

EVA: Exploring the Limits of Masked Visual Representation Learning at Scale

CVPR 2023arXiv
0
citations

BoxTeacher: Exploring High-Quality Pseudo Labels for Weakly Supervised Instance Segmentation

CVPR 2023arXiv
0
citations

RILS: Masked Visual Reconstruction in Language Semantic Space

CVPR 2023arXiv
0
citations

Boosting Low-Data Instance Segmentation by Unsupervised Pre-Training With Saliency Prompt

CVPR 2023arXiv
0
citations

Relaxed Multiple-Instance SVM With Application to Object Discovery

ICCV 2015
0
citations

Object-Level Proposals

ICCV 2017
0
citations

CCNet: Criss-Cross Attention for Semantic Segmentation

ICCV 2019
0
citations

Instances As Queries

ICCV 2021
0
citations

Crossover Learning for Fast Online Video Instance Segmentation

ICCV 2021arXiv
0
citations

Hierarchical Aggregation for 3D Instance Segmentation

ICCV 2021arXiv
0
citations

TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance

ICCV 2023arXiv
0
citations

Query6DoF: Learning Sparse Queries as Implicit Shape Prior for Category-Level 6DoF Pose Estimation

ICCV 2023
0
citations

Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection

ICCV 2023arXiv
0
citations

VAD: Vectorized Scene Representation for Efficient Autonomous Driving

ICCV 2023arXiv
0
citations

ByteTrack: Multi-Object Tracking by Associating Every Detection Box

ECCV 2022
0
citations

Robust Multi-Object Tracking by Marginal Inference

ECCV 2022
0
citations

AiATrack: Attention in Attention for Transformer Visual Tracking

ECCV 2022
0
citations

Context-Sensitive Temporal Feature Learning for Gait Recognition

ICCV 2021
0
citations

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving

CVPR 2025
0
citations

YOLO-World: Real-Time Open-Vocabulary Object Detection

CVPR 2024
0
citations

Symphonize 3D Semantic Scene Completion with Contextual Instance Queries

CVPR 2024
0
citations

Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

ICML 2024
0
citations

DeepContour: A Deep Convolutional Feature Learned by Positive-Sharing Loss for Contour Detection

CVPR 2015
0
citations

Robust Scene Text Recognition With Automatic Rectification

CVPR 2016
0
citations

Multiple Instance Detection Network With Online Instance Classifier Refinement

CVPR 2017arXiv
0
citations

Weakly-Supervised Semantic Segmentation Network With Deep Seeded Region Growing

CVPR 2018
0
citations

RENAS: Reinforced Evolutionary Neural Architecture Search

CVPR 2019
0
citations

Mask Scoring R-CNN

CVPR 2019
0
citations

Direct Object Recognition Without Line-Of-Sight Using Optical Coherence

CVPR 2019
0
citations

Densely Connected Search Space for More Flexible Neural Architecture Search

CVPR 2020arXiv
0
citations

You Only Look at One Sequence: Rethinking Transformer in Vision through Object Detection

NeurIPS 2021
0
citations

Circuit as Set of Points

NeurIPS 2023
0
citations