Bing Li

49
Papers
99
Total Citations

Papers (49)

NARUTO: Neural Active Reconstruction from Uncertain Target Observations

CVPR 2024
27
citations

Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression

ICLR 2025
19
citations

PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts

ECCV 2024
16
citations

Unleashing the Potential of the Semantic Latent Space in Diffusion Models for Image Dehazing

ECCV 2024
11
citations

Visual-Instructed Degradation Diffusion for All-in-One Image Restoration

CVPR 2025
9
citations

Benchmarking Segmentation Models with Mask-Preserved Attribute Editing

CVPR 2024
7
citations

SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data

ICCV 2025
6
citations

WiFi CSI Based Temporal Activity Detection via Dual Pyramid Network

AAAI 2025
4
citations

Similar Modality Enhancement and Action Consistency Learning for Weakly Supervised Temporal Action Localization

AAAI 2025
0
citations

Union Is Strength! Unite the Power of LLMs and MLLMs for Chart Question Answering

AAAI 2025
0
citations

Federated Recommendation with Explicitly Encoding Item Bias

AAAI 2025
0
citations

Variable Importance in High-Dimensional Settings Requires Grouping

AAAI 2024
0
citations

Tune-An-Ellipse: CLIP Has Potential to Find What You Want

CVPR 2024
0
citations

How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?

CVPR 2024
0
citations

Spatio-Temporal Self-Organizing Map Deep Network for Dynamic Object Detection From Videos

CVPR 2017
0
citations

Depth-Aware Stereo Video Retargeting

CVPR 2018
0
citations

Knowledge Distillation via Instance Relationship Graph

CVPR 2019
0
citations

Object Relational Graph With Teacher-Recommended Learning for Video Captioning

CVPR 2020arXiv
0
citations

Open-Book Video Captioning With Retrieve-Copy-Generate Network

CVPR 2021arXiv
0
citations

Improving Visual Grounding With Visual-Linguistic Verification and Iterative Reasoning

CVPR 2022arXiv
0
citations

EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching

CVPR 2022arXiv
0
citations

AUNet: Learning Relations Between Action Units for Face Forgery Detection

CVPR 2023
0
citations

Learning To Exploit the Sequence-Specific Prior Knowledge for Image Processing Pipelines Optimization

CVPR 2023
0
citations

NewsNet: A Novel Dataset for Hierarchical Temporal Segmentation

CVPR 2023
0
citations

AdaptiveMix: Improving GAN Training via Feature Space Shrinkage

CVPR 2023
0
citations

ViLEM: Visual-Language Error Modeling for Image-Text Retrieval

CVPR 2023
0
citations

Channel-Wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition

ICCV 2021arXiv
0
citations

High Quality Disparity Remapping With Two-Stage Warping

ICCV 2021
0
citations

Reversing Flow for Image Restoration

CVPR 2025
0
citations

Automatic Animation of Hair Blowing in Still Portrait Photos

ICCV 2023arXiv
0
citations

Order-Prompted Tag Sequence Generation for Video Tagging

ICCV 2023
0
citations

Learning to Identify Critical States for Reinforcement Learning from Videos

ICCV 2023arXiv
0
citations

CVRecon: Rethinking 3D Geometric Feature Learning For Neural Reconstruction

ICCV 2023arXiv
0
citations

Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model

ECCV 2020
0
citations

Attention-Aware Learning for Hyperparameter Prediction in Image Processing Pipelines

ECCV 2022
0
citations

Disentangling Object Motion and Occlusion for Unsupervised Multi-Frame Monocular Depth

ECCV 2022
0
citations

Learn To Match: Automatic Matching Network Design for Visual Tracking

ICCV 2021arXiv
0
citations

Multimodal Large Language Model-Guided ISP Hyperparameter Optimization with Dynamic Preference Learning

ICCV 2025
0
citations

VisionMath: Vision-Form Mathematical Problem-Solving

ICCV 2025
0
citations

4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding

ICCV 2025
0
citations

Point Cloud Self-supervised Learning via 3D to Multi-view Masked Learner

ICCV 2025
0
citations

OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions

NeurIPS 2025
0
citations

SynCL: A Synergistic Training Strategy with Instance-Aware Contrastive Learning for End-to-End Multi-Camera 3D Tracking

NeurIPS 2025
0
citations

Towards More Discriminative Feature Learning in SNNs with Temporal-Self-Erasing Supervision

AAAI 2025
0
citations

Dynamically Masked Discriminator for GANs

NeurIPS 2023
0
citations

Compressed Video Prompt Tuning

NeurIPS 2023
0
citations

Exploiting Contextual Objects and Relations for 3D Visual Grounding

NeurIPS 2023
0
citations

ZoomTrack: Target-aware Non-uniform Resizing for Efficient Visual Tracking

NeurIPS 2023
0
citations

Bridging the Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models

NeurIPS 2023
0
citations