Guanbin Li

87
Papers
131
Total Citations

Papers (87)

AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning

CVPR 2024
24
citations

NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation

CVPR 2024
20
citations

Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection

CVPR 2024
19
citations

OVER-NAV: Elevating Iterative Vision-and-Language Navigation with Open-Vocabulary Detection and StructurEd Representation

CVPR 2024
15
citations

Cell Graph Transformer for Nuclei Classification

AAAI 2024arXiv
12
citations

Rethinking Query-based Transformer for Continual Image Segmentation

CVPR 2025
9
citations

GeoSplatting: Towards Geometry Guided Gaussian Splatting for Physically-based Inverse Rendering

ICCV 2025
8
citations

DreamFuse: Adaptive Image Fusion with Diffusion Transformer

ICCV 2025
5
citations

DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Model

ICCV 2025
5
citations

UniCell: Universal Cell Nucleus Classification via Prompt Learning

AAAI 2024arXiv
4
citations

Empowering Large Language Models with 3D Situation Awareness

CVPR 2025
3
citations

Hierarchically Controlled Deformable 3D Gaussians for Talking Head Synthesis

AAAI 2025
2
citations

Sim-DETR: Unlock DETR for Temporal Sentence Grounding

ICCV 2025
1
citations

Bridging Knowledge Gap Between Image Inpainting and Large-Area Visible Watermark Removal

AAAI 2025
1
citations

DeepShield: Fortifying Deepfake Video Detection with Local and Global Forgery Analysis

ICCV 2025
1
citations

FakeRadar: Probing Forgery Outliers to Detect Unknown Deepfake Videos

ICCV 2025
1
citations

Free-MoRef: Instantly Multiplexing Context Perception Capabilities of Video-MLLMs within Single Inference

ICCV 2025
1
citations

Deep Contrast Learning for Salient Object Detection

CVPR 2016
0
citations

Attention-Aware Face Hallucination via Deep Reinforcement Learning

CVPR 2017arXiv
0
citations

Instance-Level Salient Object Segmentation

CVPR 2017arXiv
0
citations

Flow Guided Recurrent Neural Encoder for Video Salient Object Detection

CVPR 2018
0
citations

Interpretable Video Captioning via Trajectory Structured Localization

CVPR 2018
0
citations

Visual Question Reasoning on General Dependency Tree

CVPR 2018arXiv
0
citations

Cross-Modal Relationship Inference for Grounding Referring Expressions

CVPR 2019
0
citations

ClusterNet: Deep Hierarchical Cluster Network With Rigorously Rotation-Invariant Representation for Point Cloud Analysis

CVPR 2019
0
citations

Referring Image Segmentation via Cross-Modal Progressive Comprehension

CVPR 2020arXiv
0
citations

A Real-Time Cross-Modality Correlation Filtering Method for Referring Expression Comprehension

CVPR 2020arXiv
0
citations

Graph-Structured Referring Expression Reasoning in the Wild

CVPR 2020arXiv
0
citations

Scene-Intuitive Agent for Remote Embodied Visual Grounding

CVPR 2021arXiv
0
citations

Bottom-Up Shift and Reasoning for Referring Image Segmentation

CVPR 2021
0
citations

Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting

CVPR 2021arXiv
0
citations

Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation

CVPR 2021arXiv
0
citations

Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation

CVPR 2021arXiv
0
citations

X-Trans2Cap: Cross-Modal Knowledge Transfer Using Transformer for 3D Dense Captioning

CVPR 2022
0
citations

Dual Adversarial Adaptation for Cross-Device Real-World Image Super-Resolution

CVPR 2022arXiv
0
citations

Identity-Preserving Talking Face Generation With Landmark and Appearance Priors

CVPR 2023arXiv
0
citations

Being Comes From Not-Being: Open-Vocabulary Text-to-Motion Generation With Wordless Training

CVPR 2023
0
citations

Parametric Implicit Face Representation for Audio-Driven Facial Reenactment

CVPR 2023
0
citations

Improved Distribution Matching for Dataset Condensation

CVPR 2023
0
citations

SCoDA: Domain Adaptive Shape Completion for Real Scans

CVPR 2023arXiv
0
citations

Semi-DETR: Semi-Supervised Object Detection With Detection Transformers

CVPR 2023
0
citations

Divide and Adapt: Active Domain Adaptation via Customized Learning

CVPR 2023
0
citations

Advancing Visual Grounding With Scene Knowledge: Benchmark and Method

CVPR 2023
0
citations

Multi-Label Image Recognition by Recurrently Discovering Attentional Regions

ICCV 2017arXiv
0
citations

Larger Norm More Transferable: An Adaptive Feature Norm Approach for Unsupervised Domain Adaptation

ICCV 2019
0
citations

Crowd Counting With Deep Structured Scale Integration Network

ICCV 2019
0
citations

Semi-Supervised Skin Detection by Network With Mutual Guidance

ICCV 2019
0
citations

Fashion Retrieval via Graph Reasoning Networks on a Similarity Pyramid

ICCV 2019
0
citations

Dynamic Graph Attention for Referring Expression Comprehension

ICCV 2019
0
citations

VTON 360: High-Fidelity Virtual Try-On from Any Viewing Direction

CVPR 2025
0
citations

Semi-Supervised Video Salient Object Detection Using Pseudo-Labels

ICCV 2019
0
citations

Towards Interpretable Deep Networks for Monocular Depth Estimation

ICCV 2021arXiv
0
citations

LapsCore: Language-Guided Person Search via Color Reasoning

ICCV 2021
0
citations

Trash To Treasure: Harvesting OOD Data With Cross-Modal Matching for Open-Set Semi-Supervised Learning

ICCV 2021arXiv
0
citations

Enhanced Soft Label for Semi-Supervised Semantic Segmentation

ICCV 2023
0
citations

SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training

ICCV 2023arXiv
0
citations

Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection

ICCV 2023
0
citations

Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation

ICCV 2023arXiv
0
citations

Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection

ICCV 2023
0
citations

RankMatch: Fostering Confidence and Consistency in Learning with Noisy Labels

ICCV 2023
0
citations

Towards Real-World Burst Image Super-Resolution: Benchmark and Method

ICCV 2023
0
citations

Towards Unifying Medical Vision-and-Language Pre-Training via Soft Prompts

ICCV 2023arXiv
0
citations

Linguistic Structure Guided Context Modeling for Referring Image Segmentation

ECCV 2020
0
citations

Collaborative Training between Region Proposal Localization and Classification for Domain Adaptive Object Detection

ECCV 2020
0
citations

Peeking into occluded joints: A novel framework for crowd pose estimation

ECCV 2020
0
citations

Propagating Over Phrase Relations for One-Stage Visual Grounding

ECCV 2020
0
citations

Neighborhood Collective Estimation for Noisy Label Identification and Correction

ECCV 2022
0
citations

Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy Labels

ECCV 2022
0
citations

Motion Guided Attention for Video Salient Object Detection

ICCV 2019
0
citations

LLM-driven Multimodal and Multi-Identity Listening Head Generation

CVPR 2025
0
citations

DAGSM: Disentangled Avatar Generation with GS-enhanced Mesh

CVPR 2025
0
citations

DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering

CVPR 2025
0
citations

Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method

CVPR 2025
0
citations

AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving

ICCV 2025
0
citations

VLDrive: Vision-Augmented Lightweight MLLMs for Efficient Language-grounded Autonomous Driving

ICCV 2025
0
citations

Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering

ICCV 2025
0
citations

GlassWizard: Harvesting Diffusion Priors for Glass Surface Detection

ICCV 2025
0
citations

LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation

ICCV 2025
0
citations

FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy Labels

AAAI 2024arXiv
0
citations

Variance-Insensitive and Target-Preserving Mask Refinement for Interactive Image Segmentation

AAAI 2024arXiv
0
citations

Removing Interference and Recovering Content Imaginatively for Visible Watermark Removal

AAAI 2024
0
citations

Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training

CVPR 2024
0
citations

Open-Vocabulary Segmentation with Semantic-Assisted Calibration

CVPR 2024
0
citations

Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection

CVPR 2024
0
citations

Visual Saliency Based on Multiscale Deep Features

CVPR 2015
0
citations

Divide and Contrast: Source-free Domain Adaptation via Adaptive Contrastive Learning

NeurIPS 2022
0
citations

Multivariate-Information Adversarial Ensemble for Scalable Joint Distribution Matching

ICML 2019
0
citations