Guanbin Li
87
Papers
131
Total Citations
Papers (87)
AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning
CVPR 2024
24
citations
NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation
CVPR 2024
20
citations
Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection
CVPR 2024
19
citations
OVER-NAV: Elevating Iterative Vision-and-Language Navigation with Open-Vocabulary Detection and StructurEd Representation
CVPR 2024
15
citations
Cell Graph Transformer for Nuclei Classification
AAAI 2024arXiv
12
citations
Rethinking Query-based Transformer for Continual Image Segmentation
CVPR 2025
9
citations
GeoSplatting: Towards Geometry Guided Gaussian Splatting for Physically-based Inverse Rendering
ICCV 2025
8
citations
DreamFuse: Adaptive Image Fusion with Diffusion Transformer
ICCV 2025
5
citations
DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Model
ICCV 2025
5
citations
UniCell: Universal Cell Nucleus Classification via Prompt Learning
AAAI 2024arXiv
4
citations
Empowering Large Language Models with 3D Situation Awareness
CVPR 2025
3
citations
Hierarchically Controlled Deformable 3D Gaussians for Talking Head Synthesis
AAAI 2025
2
citations
Sim-DETR: Unlock DETR for Temporal Sentence Grounding
ICCV 2025
1
citations
Bridging Knowledge Gap Between Image Inpainting and Large-Area Visible Watermark Removal
AAAI 2025
1
citations
DeepShield: Fortifying Deepfake Video Detection with Local and Global Forgery Analysis
ICCV 2025
1
citations
FakeRadar: Probing Forgery Outliers to Detect Unknown Deepfake Videos
ICCV 2025
1
citations
Free-MoRef: Instantly Multiplexing Context Perception Capabilities of Video-MLLMs within Single Inference
ICCV 2025
1
citations
Deep Contrast Learning for Salient Object Detection
CVPR 2016
0
citations
Attention-Aware Face Hallucination via Deep Reinforcement Learning
CVPR 2017arXiv
0
citations
Instance-Level Salient Object Segmentation
CVPR 2017arXiv
0
citations
Flow Guided Recurrent Neural Encoder for Video Salient Object Detection
CVPR 2018
0
citations
Interpretable Video Captioning via Trajectory Structured Localization
CVPR 2018
0
citations
Visual Question Reasoning on General Dependency Tree
CVPR 2018arXiv
0
citations
Cross-Modal Relationship Inference for Grounding Referring Expressions
CVPR 2019
0
citations
ClusterNet: Deep Hierarchical Cluster Network With Rigorously Rotation-Invariant Representation for Point Cloud Analysis
CVPR 2019
0
citations
Referring Image Segmentation via Cross-Modal Progressive Comprehension
CVPR 2020arXiv
0
citations
A Real-Time Cross-Modality Correlation Filtering Method for Referring Expression Comprehension
CVPR 2020arXiv
0
citations
Graph-Structured Referring Expression Reasoning in the Wild
CVPR 2020arXiv
0
citations
Scene-Intuitive Agent for Remote Embodied Visual Grounding
CVPR 2021arXiv
0
citations
Bottom-Up Shift and Reasoning for Referring Image Segmentation
CVPR 2021
0
citations
Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting
CVPR 2021arXiv
0
citations
Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation
CVPR 2021arXiv
0
citations
Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation
CVPR 2021arXiv
0
citations
X-Trans2Cap: Cross-Modal Knowledge Transfer Using Transformer for 3D Dense Captioning
CVPR 2022
0
citations
Dual Adversarial Adaptation for Cross-Device Real-World Image Super-Resolution
CVPR 2022arXiv
0
citations
Identity-Preserving Talking Face Generation With Landmark and Appearance Priors
CVPR 2023arXiv
0
citations
Being Comes From Not-Being: Open-Vocabulary Text-to-Motion Generation With Wordless Training
CVPR 2023
0
citations
Parametric Implicit Face Representation for Audio-Driven Facial Reenactment
CVPR 2023
0
citations
Improved Distribution Matching for Dataset Condensation
CVPR 2023
0
citations
SCoDA: Domain Adaptive Shape Completion for Real Scans
CVPR 2023arXiv
0
citations
Semi-DETR: Semi-Supervised Object Detection With Detection Transformers
CVPR 2023
0
citations
Divide and Adapt: Active Domain Adaptation via Customized Learning
CVPR 2023
0
citations
Advancing Visual Grounding With Scene Knowledge: Benchmark and Method
CVPR 2023
0
citations
Multi-Label Image Recognition by Recurrently Discovering Attentional Regions
ICCV 2017arXiv
0
citations
Larger Norm More Transferable: An Adaptive Feature Norm Approach for Unsupervised Domain Adaptation
ICCV 2019
0
citations
Crowd Counting With Deep Structured Scale Integration Network
ICCV 2019
0
citations
Semi-Supervised Skin Detection by Network With Mutual Guidance
ICCV 2019
0
citations
Fashion Retrieval via Graph Reasoning Networks on a Similarity Pyramid
ICCV 2019
0
citations
Dynamic Graph Attention for Referring Expression Comprehension
ICCV 2019
0
citations
VTON 360: High-Fidelity Virtual Try-On from Any Viewing Direction
CVPR 2025
0
citations
Semi-Supervised Video Salient Object Detection Using Pseudo-Labels
ICCV 2019
0
citations
Towards Interpretable Deep Networks for Monocular Depth Estimation
ICCV 2021arXiv
0
citations
LapsCore: Language-Guided Person Search via Color Reasoning
ICCV 2021
0
citations
Trash To Treasure: Harvesting OOD Data With Cross-Modal Matching for Open-Set Semi-Supervised Learning
ICCV 2021arXiv
0
citations
Enhanced Soft Label for Semi-Supervised Semantic Segmentation
ICCV 2023
0
citations
SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training
ICCV 2023arXiv
0
citations
Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection
ICCV 2023
0
citations
Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation
ICCV 2023arXiv
0
citations
Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection
ICCV 2023
0
citations
RankMatch: Fostering Confidence and Consistency in Learning with Noisy Labels
ICCV 2023
0
citations
Towards Real-World Burst Image Super-Resolution: Benchmark and Method
ICCV 2023
0
citations
Towards Unifying Medical Vision-and-Language Pre-Training via Soft Prompts
ICCV 2023arXiv
0
citations
Linguistic Structure Guided Context Modeling for Referring Image Segmentation
ECCV 2020
0
citations
Collaborative Training between Region Proposal Localization and Classification for Domain Adaptive Object Detection
ECCV 2020
0
citations
Peeking into occluded joints: A novel framework for crowd pose estimation
ECCV 2020
0
citations
Propagating Over Phrase Relations for One-Stage Visual Grounding
ECCV 2020
0
citations
Neighborhood Collective Estimation for Noisy Label Identification and Correction
ECCV 2022
0
citations
Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy Labels
ECCV 2022
0
citations
Motion Guided Attention for Video Salient Object Detection
ICCV 2019
0
citations
LLM-driven Multimodal and Multi-Identity Listening Head Generation
CVPR 2025
0
citations
DAGSM: Disentangled Avatar Generation with GS-enhanced Mesh
CVPR 2025
0
citations
DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering
CVPR 2025
0
citations
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
CVPR 2025
0
citations
AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving
ICCV 2025
0
citations
VLDrive: Vision-Augmented Lightweight MLLMs for Efficient Language-grounded Autonomous Driving
ICCV 2025
0
citations
Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
ICCV 2025
0
citations
GlassWizard: Harvesting Diffusion Priors for Glass Surface Detection
ICCV 2025
0
citations
LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
ICCV 2025
0
citations
FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy Labels
AAAI 2024arXiv
0
citations
Variance-Insensitive and Target-Preserving Mask Refinement for Interactive Image Segmentation
AAAI 2024arXiv
0
citations
Removing Interference and Recovering Content Imaginatively for Visible Watermark Removal
AAAI 2024
0
citations
Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training
CVPR 2024
0
citations
Open-Vocabulary Segmentation with Semantic-Assisted Calibration
CVPR 2024
0
citations
Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection
CVPR 2024
0
citations
Visual Saliency Based on Multiscale Deep Features
CVPR 2015
0
citations
Divide and Contrast: Source-free Domain Adaptation via Adaptive Contrastive Learning
NeurIPS 2022
0
citations
Multivariate-Information Adversarial Ensemble for Scalable Joint Distribution Matching
ICML 2019
0
citations