Xiaoqin Zhang

16
Papers
28
Total Citations

Papers (16)

VSFormer: Visual-Spatial Fusion Transformer for Correspondence Pruning

AAAI 2024arXiv
15
citations

Weakly Supervised Monocular 3D Detection with a Single-View Image

CVPR 2024arXiv
12
citations

PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations

ICCV 2025arXiv
1
citations

Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention

CVPR 2025arXiv
0
citations

Spatial Preference Rewarding for MLLMs Spatial Understanding

ICCV 2025arXiv
0
citations

Face Retouching with Diffusion Data Generation and Spectral Restorement

ICCV 2025
0
citations

SMSTracker: Tri-path Score Mask Sigma Fusion for Multi-Modal Tracking

ICCV 2025
0
citations

PacGDC: Label-Efficient Generalizable Depth Completion with Projection Ambiguity and Consistency

ICCV 2025arXiv
0
citations

SGFormer: Semantic-Geometry Fusion Transformer for Multi-modal 3D Panoptic Segmentation

AAAI 2025
0
citations

Masked AutoDecoder is Effective Multi-Task Vision Generalist

CVPR 2024arXiv
0
citations

FAC: 3D Representation Learning via Foreground Aware Feature Contrast

CVPR 2023arXiv
0
citations

DA-DETR: Domain Adaptive Detection Transformer With Information Fusion

CVPR 2023
0
citations

Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors

CVPR 2023arXiv
0
citations

UniDAformer: Unified Domain Adaptive Panoptic Segmentation Transformer via Hierarchical Mask Calibration

CVPR 2023arXiv
0
citations

Pose-Free Neural Radiance Fields via Implicit Pose Regularization

ICCV 2023arXiv
0
citations

WaveNeRF: Wavelet-based Generalizable Neural Radiance Fields

ICCV 2023arXiv
0
citations