Shijian Lu
78
Papers
1,140
Total Citations
Papers (78)
Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding
CVPR 2024
449
citations
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
ICCV 2025
206
citations
Multiple Expert Brainstorming for Domain Adaptive Person Re-identification
ECCV 2020
188
citations
Efficient Test-Time Adaptation of Vision-Language Models
CVPR 2024
109
citations
FreGS: 3D Gaussian Splatting with Progressive Frequency Regularization
CVPR 2024
106
citations
LEED: Label-Free Expression Editing via Disentanglement
ECCV 2020
27
citations
The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
NeurIPS 2025
26
citations
Weakly Supervised Monocular 3D Detection with a Single-View Image
CVPR 2024
12
citations
Backdoor Attacks Against No-Reference Image Quality Assessment Models via a Scalable Trigger
AAAI 2025
10
citations
DA-BEV: Unsupervised Domain Adaptation for Bird's Eye View Perception
ECCV 2024arXiv
6
citations
PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations
ICCV 2025
1
citations
Purify Unlearnable Examples via Rate-Constrained Variational Autoencoders
ICML 2024
0
citations
Discriminative Multi-Modal Feature Fusion for RGBD Indoor Scene Recognition
CVPR 2016
0
citations
ESIR: End-To-End Scene Text Recognition via Iterative Image Rectification
CVPR 2019
0
citations
Spatial Fusion GAN for Image Synthesis
CVPR 2019
0
citations
Towards Natural and Accurate Future Motion Prediction of Humans and Animals
CVPR 2019
0
citations
Cascade EF-GAN: Progressive Facial Expression Editing With Local Focuses
CVPR 2020
0
citations
Suppressing Uncertainties for Large-Scale Facial Expression Recognition
CVPR 2020arXiv
0
citations
AD-Cluster: Augmented Discriminative Clustering for Domain Adaptive Person Re-Identification
CVPR 2020
0
citations
Cross-View Regularization for Domain Adaptive Panoptic Segmentation
CVPR 2021arXiv
0
citations
Unbalanced Feature Transport for Exemplar-Based Image Translation
CVPR 2021arXiv
0
citations
FSDR: Frequency Space Domain Randomization for Domain Generalization
CVPR 2021arXiv
0
citations
Accelerating DETR Convergence via Semantic-Aligned Matching
CVPR 2022arXiv
0
citations
Category Contrast for Unsupervised Domain Adaptation in Visual Tasks
CVPR 2022arXiv
0
citations
Spectral Unsupervised Domain Adaptation for Visual Recognition
CVPR 2022arXiv
0
citations
Fourier Document Restoration for Robust Document Dewarping and Recognition
CVPR 2022arXiv
0
citations
Unbiased Subclass Regularization for Semi-Supervised Semantic Segmentation
CVPR 2022arXiv
0
citations
PTTR: Relational 3D Point Cloud Object Tracking With Transformer
CVPR 2022arXiv
0
citations
Marginal Contrastive Correspondence for Guided Image Generation
CVPR 2022arXiv
0
citations
Modulated Contrast for Versatile Image Synthesis
CVPR 2022arXiv
0
citations
Regularized Vector Quantization for Tokenized Image Synthesis
CVPR 2023arXiv
0
citations
FAC: 3D Representation Learning via Foreground Aware Feature Contrast
CVPR 2023arXiv
0
citations
DA-DETR: Domain Adaptive Detection Transformer With Information Fusion
CVPR 2023
0
citations
StyleRF: Zero-Shot 3D Style Transfer of Neural Radiance Fields
CVPR 2023arXiv
0
citations
3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds
CVPR 2023arXiv
0
citations
KD-DLGAN: Data Limited Image Generation via Knowledge Distillation
CVPR 2023
0
citations
Backdoor Attacks Against Deep Image Compression via Adaptive Frequency Trigger
CVPR 2023arXiv
0
citations
Towards Efficient Use of Multi-Scale Features in Transformer-Based Object Detectors
CVPR 2023arXiv
0
citations
UniDAformer: Unified Domain Adaptive Panoptic Segmentation Transformer via Hierarchical Mask Calibration
CVPR 2023arXiv
0
citations
Text Flow: A Unified Text Detection System in Natural Scene Images
ICCV 2015
0
citations
WeText: Scene Text Detection Under Weak Supervision
ICCV 2017arXiv
0
citations
TORNADO: A Spatio-Temporal Convolutional Regression Network for Video Action Proposal
ICCV 2017
0
citations
GA-DAN: Geometry-Aware Domain Adaptation Network for Scene Text Detection and Recognition
ICCV 2019
0
citations
Skeleton Cloud Colorization for Unsupervised 3D Action Representation Learning
ICCV 2021arXiv
0
citations
Domain Adaptive Video Segmentation via Temporal Consistency Regularization
ICCV 2021arXiv
0
citations
Unsupervised Domain Adaptive 3D Detection With Multi-Level Consistency
ICCV 2021arXiv
0
citations
WaveFill: A Wavelet-Based Generation Network for Image Inpainting
ICCV 2021arXiv
0
citations
Sparse Needlets for Lighting Estimation With Spherical Transport Loss
ICCV 2021arXiv
0
citations
RDA: Robust Domain Adaptation via Fourier Adversarial Attacking
ICCV 2021arXiv
0
citations
Pose-Free Neural Radiance Fields via Implicit Pose Regularization
ICCV 2023arXiv
0
citations
Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
CVPR 2025
0
citations
WaveNeRF: Wavelet-based Generalizable Neural Radiance Fields
ICCV 2023arXiv
0
citations
Black-Box Unsupervised Domain Adaptation with Bi-Directional Atkinson-Shiffrin Memory
ICCV 2023arXiv
0
citations
Collaborative Learning of Gesture Recognition and 3D Hand Pose Estimation with Multi-Order Feature Analysis
ECCV 2020
0
citations
AMLN: Adversarial-based Mutual Learning Network for Online Knowledge Distillation
ECCV 2020
0
citations
Contextual-Relation Consistent Domain Adaptation for Semantic Segmentation
ECCV 2020
0
citations
Auto-Regressive Image Synthesis with Integrated Quantization
ECCV 2022
0
citations
Bi-Level Feature Alignment for Versatile Image Translation and Manipulation
ECCV 2022
0
citations
Language Matters: A Weakly Supervised Vision-Language Pre-training Approach for Scene Text Detection and Spotting
ECCV 2022
0
citations
Contextual Text Block Detection towards Scene Text Understanding
ECCV 2022
0
citations
Domain Adaptive Video Segmentation via Temporal Pseudo Supervision
ECCV 2022
0
citations
Domain Generalization via Balancing Training Difficulty and Model Capability
ICCV 2023arXiv
0
citations
SOGS: Second-Order Anchor for Advanced 3D Gaussian Splatting
CVPR 2025
0
citations
Spatial Preference Rewarding for MLLMs Spatial Understanding
ICCV 2025
0
citations
Versatile Transition Generation with Image-to-Video Diffusion
ICCV 2025
0
citations
Face Retouching with Diffusion Data Generation and Spectral Restorement
ICCV 2025
0
citations
TimeExpert: An Expert-Guided Video LLM for Video Temporal Grounding
ICCV 2025
0
citations
SMSTracker: Tri-path Score Mask Sigma Fusion for Multi-Modal Tracking
ICCV 2025
0
citations
PacGDC: Label-Efficient Generalizable Depth Completion with Projection Ambiguity and Consistency
ICCV 2025
0
citations
Modeling Continuous Motion for 3D Point Cloud Object Tracking
AAAI 2024arXiv
0
citations
Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining
CVPR 2024
0
citations
Masked AutoDecoder is Effective Multi-Task Vision Generalist
CVPR 2024
0
citations
Model Adaptation: Historical Contrastive Learning for Unsupervised Domain Adaptation without Source Data
NeurIPS 2021
0
citations
Masked Generative Adversarial Networks are Data-Efficient Generation Learners
NeurIPS 2022
0
citations
PolarMix: A General Data Augmentation Technique for LiDAR Point Clouds
NeurIPS 2022
0
citations
Online Map Vectorization for Autonomous Driving: A Rasterization Perspective
NeurIPS 2023
0
citations
Weakly Supervised 3D Open-vocabulary Segmentation
NeurIPS 2023
0
citations
Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation
NeurIPS 2023
0
citations