Shijian Lu

78
Papers
1,140
Total Citations

Papers (78)

Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

CVPR 2024
449
citations

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

ICCV 2025
206
citations

Multiple Expert Brainstorming for Domain Adaptive Person Re-identification

ECCV 2020
188
citations

Efficient Test-Time Adaptation of Vision-Language Models

CVPR 2024
109
citations

FreGS: 3D Gaussian Splatting with Progressive Frequency Regularization

CVPR 2024
106
citations

LEED: Label-Free Expression Editing via Disentanglement

ECCV 2020
27
citations

The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

NeurIPS 2025
26
citations

Weakly Supervised Monocular 3D Detection with a Single-View Image

CVPR 2024
12
citations

Backdoor Attacks Against No-Reference Image Quality Assessment Models via a Scalable Trigger

AAAI 2025
10
citations

DA-BEV: Unsupervised Domain Adaptation for Bird's Eye View Perception

ECCV 2024arXiv
6
citations

PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations

ICCV 2025
1
citations

Purify Unlearnable Examples via Rate-Constrained Variational Autoencoders

ICML 2024
0
citations

Discriminative Multi-Modal Feature Fusion for RGBD Indoor Scene Recognition

CVPR 2016
0
citations

ESIR: End-To-End Scene Text Recognition via Iterative Image Rectification

CVPR 2019
0
citations

Spatial Fusion GAN for Image Synthesis

CVPR 2019
0
citations

Towards Natural and Accurate Future Motion Prediction of Humans and Animals

CVPR 2019
0
citations

Cascade EF-GAN: Progressive Facial Expression Editing With Local Focuses

CVPR 2020
0
citations

Suppressing Uncertainties for Large-Scale Facial Expression Recognition

CVPR 2020arXiv
0
citations

AD-Cluster: Augmented Discriminative Clustering for Domain Adaptive Person Re-Identification

CVPR 2020
0
citations

Cross-View Regularization for Domain Adaptive Panoptic Segmentation

CVPR 2021arXiv
0
citations

Unbalanced Feature Transport for Exemplar-Based Image Translation

CVPR 2021arXiv
0
citations

FSDR: Frequency Space Domain Randomization for Domain Generalization

CVPR 2021arXiv
0
citations

Accelerating DETR Convergence via Semantic-Aligned Matching

CVPR 2022arXiv
0
citations

Category Contrast for Unsupervised Domain Adaptation in Visual Tasks

CVPR 2022arXiv
0
citations

Spectral Unsupervised Domain Adaptation for Visual Recognition

CVPR 2022arXiv
0
citations

Fourier Document Restoration for Robust Document Dewarping and Recognition

CVPR 2022arXiv
0
citations

Unbiased Subclass Regularization for Semi-Supervised Semantic Segmentation

CVPR 2022arXiv
0
citations

PTTR: Relational 3D Point Cloud Object Tracking With Transformer

CVPR 2022arXiv
0
citations

Marginal Contrastive Correspondence for Guided Image Generation

CVPR 2022arXiv
0
citations

Modulated Contrast for Versatile Image Synthesis

CVPR 2022arXiv
0
citations

Regularized Vector Quantization for Tokenized Image Synthesis

CVPR 2023arXiv
0
citations

FAC: 3D Representation Learning via Foreground Aware Feature Contrast

CVPR 2023arXiv
0
citations

DA-DETR: Domain Adaptive Detection Transformer With Information Fusion

CVPR 2023
0
citations

StyleRF: Zero-Shot 3D Style Transfer of Neural Radiance Fields

CVPR 2023arXiv
0
citations

3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds

CVPR 2023arXiv
0
citations

KD-DLGAN: Data Limited Image Generation via Knowledge Distillation

CVPR 2023
0
citations

Backdoor Attacks Against Deep Image Compression via Adaptive Frequency Trigger

CVPR 2023arXiv
0
citations

Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors

CVPR 2023arXiv
0
citations

UniDAformer: Unified Domain Adaptive Panoptic Segmentation Transformer via Hierarchical Mask Calibration

CVPR 2023arXiv
0
citations

Text Flow: A Unified Text Detection System in Natural Scene Images

ICCV 2015
0
citations

WeText: Scene Text Detection Under Weak Supervision

ICCV 2017arXiv
0
citations

TORNADO: A Spatio-Temporal Convolutional Regression Network for Video Action Proposal

ICCV 2017
0
citations

GA-DAN: Geometry-Aware Domain Adaptation Network for Scene Text Detection and Recognition

ICCV 2019
0
citations

Skeleton Cloud Colorization for Unsupervised 3D Action Representation Learning

ICCV 2021arXiv
0
citations

Domain Adaptive Video Segmentation via Temporal Consistency Regularization

ICCV 2021arXiv
0
citations

Unsupervised Domain Adaptive 3D Detection With Multi-Level Consistency

ICCV 2021arXiv
0
citations

WaveFill: A Wavelet-Based Generation Network for Image Inpainting

ICCV 2021arXiv
0
citations

Sparse Needlets for Lighting Estimation With Spherical Transport Loss

ICCV 2021arXiv
0
citations

RDA: Robust Domain Adaptation via Fourier Adversarial Attacking

ICCV 2021arXiv
0
citations

Pose-Free Neural Radiance Fields via Implicit Pose Regularization

ICCV 2023arXiv
0
citations

Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention

CVPR 2025
0
citations

WaveNeRF: Wavelet-based Generalizable Neural Radiance Fields

ICCV 2023arXiv
0
citations

Black-Box Unsupervised Domain Adaptation with Bi-Directional Atkinson-Shiffrin Memory

ICCV 2023arXiv
0
citations

Collaborative Learning of Gesture Recognition and 3D Hand Pose Estimation with Multi-Order Feature Analysis

ECCV 2020
0
citations

AMLN: Adversarial-based Mutual Learning Network for Online Knowledge Distillation

ECCV 2020
0
citations

Contextual-Relation Consistent Domain Adaptation for Semantic Segmentation

ECCV 2020
0
citations

Auto-Regressive Image Synthesis with Integrated Quantization

ECCV 2022
0
citations

Bi-Level Feature Alignment for Versatile Image Translation and Manipulation

ECCV 2022
0
citations

Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting

ECCV 2022
0
citations

Contextual Text Block Detection towards Scene Text Understanding

ECCV 2022
0
citations

Domain Adaptive Video Segmentation via Temporal Pseudo Supervision

ECCV 2022
0
citations

Domain Generalization via Balancing Training Difficulty and Model Capability

ICCV 2023arXiv
0
citations

SOGS: Second-Order Anchor for Advanced 3D Gaussian Splatting

CVPR 2025
0
citations

Spatial Preference Rewarding for MLLMs Spatial Understanding

ICCV 2025
0
citations

Versatile Transition Generation with Image-to-Video Diffusion

ICCV 2025
0
citations

Face Retouching with Diffusion Data Generation and Spectral Restorement

ICCV 2025
0
citations

TimeExpert: An Expert-Guided Video LLM for Video Temporal Grounding

ICCV 2025
0
citations

SMSTracker: Tri-path Score Mask Sigma Fusion for Multi-Modal Tracking

ICCV 2025
0
citations

PacGDC: Label-Efficient Generalizable Depth Completion with Projection Ambiguity and Consistency

ICCV 2025
0
citations

Modeling Continuous Motion for 3D Point Cloud Object Tracking

AAAI 2024arXiv
0
citations

Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining

CVPR 2024
0
citations

Masked AutoDecoder is Effective Multi-Task Vision Generalist

CVPR 2024
0
citations

Model Adaptation: Historical Contrastive Learning for Unsupervised Domain Adaptation without Source Data

NeurIPS 2021
0
citations

Masked Generative Adversarial Networks are Data-Efficient Generation Learners

NeurIPS 2022
0
citations

PolarMix: A General Data Augmentation Technique for LiDAR Point Clouds

NeurIPS 2022
0
citations

Online Map Vectorization for Autonomous Driving: A Rasterization Perspective

NeurIPS 2023
0
citations

Weakly Supervised 3D Open-vocabulary Segmentation

NeurIPS 2023
0
citations

Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation

NeurIPS 2023
0
citations