Dacheng Tao

217
Papers
1,232
Total Citations

Papers (217)

MUlti-Store Tracker (MUSTer): A Cognitive Psychology Inspired Approach to Object Tracking

CVPR 2015
637
citations

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

ICCV 2025
206
citations

CNNpack: Packing Convolutional Neural Networks in the Frequency Domain

NeurIPS 2016
197
citations

Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models

AAAI 2025
73
citations

Revisiting Backdoor Attacks against Large Vision-Language Models from Domain Shift

CVPR 2025
25
citations

Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages

ICLR 2024
25
citations

SimDistill: Simulated Multi-Modal Distillation for BEV 3D Object Detection

AAAI 2024arXiv
24
citations

One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls

CVPR 2024
9
citations

Synergy of Sight and Semantics: Visual Intention Understanding with CLIP

ECCV 2024
7
citations

MMReason: An Open-Ended Multi-Modal Multi-Step Reasoning Benchmark for MLLMs Toward AGI

ICCV 2025
7
citations

Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning

ICML 2025
6
citations

Free-Form Motion Control: Controlling the 6D Poses of Camera and Objects in Video Generation

ICCV 2025
4
citations

Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer

ICML 2025
4
citations

ICLShield: Exploring and Mitigating In-Context Learning Backdoor Attacks

ICML 2025
2
citations

Learning system dynamics without forgetting

ICLR 2025
2
citations

Adaptive Defense against Harmful Fine-Tuning for Large Language Models via Bayesian Data Scheduler

NeurIPS 2025
2
citations

AiDE-Q: Synthetic Labeled Datasets Can Enhance Learning Models for Quantum Property Estimation

NeurIPS 2025
1
citations

LLM Data Selection and Utilization via Dynamic Bi-level Optimization

ICML 2025
1
citations

Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications

ICML 2024
0
citations

Merging Multi-Task Models via Weight-Ensembling Mixture of Experts

ICML 2024
0
citations

Generalization Analysis of Stochastic Weight Averaging with General Sampling

ICML 2024
0
citations

Task Groupings Regularization: Data-Free Meta-Learning with Heterogeneous Pre-trained Models

ICML 2024
0
citations

Representation Surgery for Multi-Task Model Merging

ICML 2024
0
citations

Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases

ICML 2024
0
citations

Saliency Propagation From Simple to Difficult

CVPR 2015
0
citations

FaLRR: A Fast Low Rank Representation Solver

CVPR 2015
0
citations

A Maximum Entropy Feature Descriptor for Age Invariant Face Recognition

CVPR 2015
0
citations

Occlusion Boundary Detection via Deep Exploration of Context

CVPR 2016
0
citations

Part-Stacked CNN for Fine-Grained Visual Categorization

CVPR 2016
0
citations

Conditional Graphical Lasso for Multi-Label Image Classification

CVPR 2016
0
citations

Multilinear Hyperplane Hashing

CVPR 2016
0
citations

Improving Training of Deep Neural Networks via Singular Value Bounding

CVPR 2017arXiv
0
citations

On Compressing Deep Models by Low Rank and Sparse Decomposition

CVPR 2017
0
citations

Geometry-Aware Scene Text Detection With Instance Transformation Network

CVPR 2018
0
citations

Deep Ordinal Regression Network for Monocular Depth Estimation

CVPR 2018arXiv
0
citations

Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval

CVPR 2018arXiv
0
citations

An Efficient and Provable Approach for Mixture Proportion Estimation Using Linear Independence Assumption

CVPR 2018
0
citations

LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs

CVPR 2025
0
citations

Geometry-Consistent Generative Adversarial Networks for One-Sided Unsupervised Domain Mapping

CVPR 2019
0
citations

DistillHash: Unsupervised Deep Hashing by Distilling Data Pairs

CVPR 2019
0
citations

On Exploring Undetermined Relationships for Visual Relationship Detection

CVPR 2019
0
citations

Deep Modular Co-Attention Networks for Visual Question Answering

CVPR 2019
0
citations

World From Blur

CVPR 2019
0
citations

Geometry-Aware Symmetric Domain Adaptation for Monocular Depth Estimation

CVPR 2019
0
citations

Self-Supervised Representation Learning by Rotation Feature Decoupling

CVPR 2019
0
citations

Image-Question-Answer Synergistic Network for Visual Dialog

CVPR 2019
0
citations

Fast Spatio-Temporal Residual Network for Video Super-Resolution

CVPR 2019
0
citations

GPS-Net: Graph Property Sensing Network for Scene Graph Generation

CVPR 2020
0
citations

Recurrent Feature Reasoning for Image Inpainting

CVPR 2020arXiv
0
citations

On Positive-Unlabeled Classification in GAN

CVPR 2020arXiv
0
citations

Distilling Knowledge From Graph Convolutional Networks

CVPR 2020arXiv
0
citations

Learning Oracle Attention for High-Fidelity Face Completion

CVPR 2020arXiv
0
citations

Syntax-Aware Action Targeting for Video Captioning

CVPR 2020
0
citations

Context Aware Graph Convolution for Skeleton-Based Action Recognition

CVPR 2020
0
citations

FeatureFlow: Robust Video Interpolation via Structure-to-Texture Generation

CVPR 2020
0
citations

Learning Unseen Concepts via Hierarchical Decomposition and Composition

CVPR 2020
0
citations

PuppeteerGAN: Arbitrary Portrait Animation With Semantic-Aware Appearance Transformation

CVPR 2020
0
citations

AdderSR: Towards Energy Efficient Image Super-Resolution

CVPR 2021arXiv
0
citations

Online Multiple Object Tracking With Cross-Task Synergy

CVPR 2021arXiv
0
citations

Scene Essence

CVPR 2021
0
citations

HourNAS: Extremely Fast Neural Architecture Search Through an Hourglass Lens

CVPR 2021arXiv
0
citations

Tree-Like Decision Distillation

CVPR 2021
0
citations

Learning Progressive Point Embeddings for 3D Point Cloud Generation

CVPR 2021
0
citations

Turning Frequency to Resolution: Video Super-Resolution via Event Cameras

CVPR 2021
0
citations

Glance and Gaze: Inferring Action-Aware Points for One-Stage Human-Object Interaction Detection

CVPR 2021arXiv
0
citations

Where and What? Examining Interpretable Disentangled Representations

CVPR 2021arXiv
0
citations

Detecting Human-Object Interaction via Fabricated Compositional Learning

CVPR 2021arXiv
0
citations

Affordance Transfer Learning for Human-Object Interaction Detection

CVPR 2021arXiv
0
citations

Manifold Regularized Dynamic Network Pruning

CVPR 2021arXiv
0
citations

Amalgamating Knowledge From Heterogeneous Graph Neural Networks

CVPR 2021
0
citations

Contrastive Boundary Learning for Point Cloud Segmentation

CVPR 2022arXiv
0
citations

Alleviating Semantics Distortion in Unsupervised Low-Level Image-to-Image Translation via Structure Consistency Constraint

CVPR 2022
0
citations

BatchFormer: Learning To Explore Sample Relationships for Robust Representation Learning

CVPR 2022arXiv
0
citations

DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers

CVPR 2022arXiv
0
citations

GMFlow: Learning Optical Flow via Global Matching

CVPR 2022arXiv
0
citations

Recurrent Glimpse-Based Decoder for Detection With Transformer

CVPR 2022arXiv
0
citations

Learning To Collaborate in Decentralized Learning of Personalized Models

CVPR 2022
0
citations

ART-Point: Improving Rotation Robustness of Point Cloud Classifiers via Adversarial Rotation

CVPR 2022
0
citations

Source-Free Domain Adaptation via Distribution Estimation

CVPR 2022arXiv
0
citations

Distillation Using Oracle Queries for Transformer-Based Human-Object Interaction Detection

CVPR 2022
0
citations

Defensive Patches for Robust Recognition in the Physical World

CVPR 2022arXiv
0
citations

HL-Net: Heterophily Learning Network for Scene Graph Generation

CVPR 2022
0
citations

Modeling Image Composition for Complex Scene Generation

CVPR 2022arXiv
0
citations

Learning Affordance Grounding From Exocentric Images

CVPR 2022arXiv
0
citations

Few-Shot Backdoor Defense Using Shapley Estimation

CVPR 2022arXiv
0
citations

Patch Slimming for Efficient Vision Transformers

CVPR 2022arXiv
0
citations

RU-Net: Regularized Unrolling Network for Scene Graph Generation

CVPR 2022
0
citations

Continual Learning With Lifelong Vision Transformer

CVPR 2022
0
citations

Self-Augmented Unpaired Image Dehazing via Density and Depth Decomposition

CVPR 2022
0
citations

FIBA: Frequency-Injection Based Backdoor Attack in Medical Image Analysis

CVPR 2022arXiv
0
citations

Bridged Transformer for Vision and Point Cloud 3D Object Detection

CVPR 2022
0
citations

Fine-Tuning Global Model via Data-Free Knowledge Distillation for Non-IID Federated Learning

CVPR 2022arXiv
0
citations

Dynamic Focus-Aware Positional Queries for Semantic Segmentation

CVPR 2023arXiv
0
citations

Leverage Interactive Affinity for Affordance Learning

CVPR 2023
0
citations

Upcycling Models Under Domain and Category Shift

CVPR 2023
0
citations

Learnable Skeleton-Aware 3D Point Cloud Sampling

CVPR 2023
0
citations

CLAMP: Prompt-Based Contrastive Learning for Connecting Language and Animal Pose

CVPR 2023arXiv
0
citations

Exploring the Relationship Between Architectural Design and Adversarially Robust Generalization

CVPR 2023
0
citations

Generating Holistic 3D Human Motion From Speech

CVPR 2023arXiv
0
citations

Architecture, Dataset and Model-Scale Agnostic Data-Free Meta-Learning

CVPR 2023arXiv
0
citations

DeepSolo: Let Transformer Decoder With Explicit Points Solo for Text Spotting

CVPR 2023arXiv
0
citations

Make Landscape Flatter in Differentially Private Federated Learning

CVPR 2023arXiv
0
citations

From Images to Textual Prompts: Zero-Shot Visual Question Answering With Frozen Large Language Models

CVPR 2023
0
citations

Deep Graph Reprogramming

CVPR 2023arXiv
0
citations

TriDet: Temporal Action Detection With Relative Boundary Modeling

CVPR 2023arXiv
0
citations

Referring Image Matting

CVPR 2023arXiv
0
citations

Augmenting Strong Supervision Using Web Data for Fine-Grained Categorization

ICCV 2015
0
citations

Multi-Modal Factorized Bilinear Pooling With Co-Attention Learning for Visual Question Answering

ICCV 2017arXiv
0
citations

Centered Weight Normalization in Accelerating Training of Deep Neural Networks

ICCV 2017
0
citations

A Coarse-Fine Network for Keypoint Localization

ICCV 2017
0
citations

A Joint Intrinsic-Extrinsic Prior Model for Retinex

ICCV 2017
0
citations

Self-Supervised Representation Learning From Multi-Domain Data

ICCV 2019
0
citations

Approximated Bilinear Modules for Temporal Modeling

ICCV 2019
0
citations

Asymmetric Cross-Guided Attention Network for Actor and Action Video Segmentation From Natural Language Query

ICCV 2019
0
citations

Progressive Reconstruction of Visual Structure for Image Inpainting

ICCV 2019
0
citations

Deep Reinforcement Active Learning for Human-in-the-Loop Person Re-Identification

ICCV 2019
0
citations

Deep Metric Learning With Tuplet Margin Loss

ICCV 2019
0
citations

Not All Parts Are Created Equal: 3D Pose Estimation by Modeling Bi-Directional Dependencies of Body Parts

ICCV 2019
0
citations

Learning a Mixture of Granularity-Specific Experts for Fine-Grained Categorization

ICCV 2019
0
citations

Collect and Select: Semantic Alignment Metric Learning for Few-Shot Learning

ICCV 2019
0
citations

Out-of-Boundary View Synthesis Towards Full-Frame Video Stabilization

ICCV 2021arXiv
0
citations

Meta-Aggregator: Learning To Aggregate for 1-Bit Graph Neural Networks

ICCV 2021
0
citations

Adaptive Curriculum Learning

ICCV 2021
0
citations

SynFace: Face Recognition With Synthetic Data

ICCV 2021arXiv
0
citations

Stochastic Partial Swap: Enhanced Model Generalization and Interpretability for Fine-Grained Recognition

ICCV 2021
0
citations

TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition

ICCV 2021arXiv
0
citations

DDS2M: Self-Supervised Denoising Diffusion Spatio-Spectral Model for Hyperspectral Image Restoration

ICCV 2023arXiv
0
citations

Exploring Temporal Concurrency for Video-Language Representation Learning

ICCV 2023
0
citations

Domain Specified Optimization for Deployment Authorization

ICCV 2023
0
citations

Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning

ICCV 2023arXiv
0
citations

Knowledge-Aware Federated Active Learning with Non-IID Data

ICCV 2023arXiv
0
citations

Class-Aware Patch Embedding Adaptation for Few-Shot Image Classification

ICCV 2023
0
citations

Short-Term and Long-Term Context Aggregation Network for Video Inpainting

ECCV 2020
0
citations

Hallucinating Visual Instances in Total Absentia

ECCV 2020
0
citations

Learning Disentangled Representations with Latent Variation Predictability

ECCV 2020
0
citations

Symbiotic Adversarial Learning for Attribute-based Person Search

ECCV 2020
0
citations

Visual Compositional Learning for Human-Object Interaction Detection

ECCV 2020
0
citations

Spatiotemporal Attacks for Embodied Agents

ECCV 2020
0
citations

Polysemy Deciphering Network for Human-Object Interaction Detection

ECCV 2020
0
citations

Learning Propagation Rules for Attribution Map Generation

ECCV 2020
0
citations

On Dropping Clusters to Regularize Graph Convolutional Neural Networks

ECCV 2020
0
citations

Learning Graph Neural Networks for Image Style Transfer

ECCV 2022
0
citations

Towards Data-Efficient Detection Transformers

ECCV 2022
0
citations

ReAct: Temporal Action Detection with Relational Queries

ECCV 2022
0
citations

Online Continual Learning with Contrastive Vision Transformer

ECCV 2022
0
citations

VSA: Learning Varied-Size Window Attention in Vision Transformers

ECCV 2022
0
citations

Balancing Stability and Plasticity through Advanced Null Space in Continual Learning

ECCV 2022
0
citations

Discovering Human-Object Interaction Concepts via Self-Compositional Learning

ECCV 2022
0
citations

PolyphonicFormer: Unified Query Learning for Depth-Aware Video Panoptic Segmentation

ECCV 2022
0
citations

Panoptic-PartFormer: Learning a Unified Model for Panoptic Part Segmentation

ECCV 2022
0
citations

RegionCL: Exploring Contrastive Region Pairs for Self-Supervised Representation Learning

ECCV 2022
0
citations

BMD: A General Class-Balanced Multicentric Dynamic Prototype Strategy for Source-Free Domain Adaptation

ECCV 2022
0
citations

"Fashionformer: A Simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition"

ECCV 2022
0
citations

"Towards Scale-Aware, Robust, and Generalizable Unsupervised Monocular Depth Estimation by Integrating IMU Motion Dynamics"

ECCV 2022
0
citations

ST-P3: End-to-End Vision-Based Autonomous Driving via Spatial-Temporal Feature Learning

ECCV 2022
0
citations

"JPerceiver: Joint Perception Network for Depth, Pose and Layout Estimation in Driving Scenes"

ECCV 2022
0
citations

MirrorGAN: Learning Text-To-Image Generation by Redescription

CVPR 2019
0
citations

Neuron: Learning Context-Aware Evolving Representations for Zero-Shot Skeleton Action Recognition

CVPR 2025
0
citations

Harnessing Text-to-Image Diffusion Models for Point Cloud Self-Supervised Learning

ICCV 2025
0
citations

CopyrightShield: Enhancing Diffusion Model Security Against Copyright Infringement Attacks

ICCV 2025
0
citations

Rethink Sparse Signals for Pose-guided Text-to-image Generation

ICCV 2025
0
citations

Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning

NeurIPS 2025
0
citations

Improving Complex Reasoning over Knowledge Graph with Logic-Aware Curriculum Tuning

AAAI 2025
0
citations

Modeling All Response Surfaces in One for Conditional Search Spaces

AAAI 2025
0
citations

TD²-Net: Toward Denoising and Debiasing for Video Scene Graph Generation

AAAI 2024
0
citations

Multi-Step Denoising Scheduled Sampling: Towards Alleviating Exposure Bias for Diffusion Models

AAAI 2024
0
citations

Sheared Backpropagation for Fine-tuning Foundation Models

CVPR 2024
0
citations

UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather

CVPR 2024
0
citations

FREE: Faster and Better Data-Free Meta-Learning

CVPR 2024
0
citations

Local-consistent Transformation Learning for Rotation-invariant Point Cloud Analysis

CVPR 2024
0
citations

Learn from Downstream and Be Yourself in Multimodal Large Language Models Fine-Tuning

ICML 2025
0
citations

HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning

ICML 2024
0
citations

Q-value Regularized Transformer for Offline Reinforcement Learning

ICML 2024
0
citations

Towards Theoretical Understandings of Self-Consuming Generative Models

ICML 2024
0
citations

Learning Versatile Filters for Efficient Convolutional Neural Networks

NeurIPS 2018
0
citations

Dual Swap Disentangling

NeurIPS 2018
0
citations

LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning

NeurIPS 2019
0
citations

Theoretical Analysis of Adversarial Learning: A Minimax Approach

NeurIPS 2019
0
citations

Likelihood-Free Overcomplete ICA and Applications In Causal Discovery

NeurIPS 2019arXiv
0
citations

Category Anchor-Guided Unsupervised Domain Adaptation for Semantic Segmentation

NeurIPS 2019
0
citations

Learning from Bad Data via Generation

NeurIPS 2019
0
citations

Positive-Unlabeled Compression on the Cloud

NeurIPS 2019
0
citations

Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge

NeurIPS 2019
0
citations

Control Batch Size and Learning Rate to Generalize Well: Theoretical and Empirical Evidence

NeurIPS 2019
0
citations

Auto Learning Attention

NeurIPS 2020
0
citations

Searching for Low-Bit Weights in Quantized Neural Networks

NeurIPS 2020
0
citations

Part-dependent Label Noise: Towards Instance-dependent Label Noise

NeurIPS 2020
0
citations

SCOP: Scientific Control for Reliable Neural Network Pruning

NeurIPS 2020
0
citations

Video Frame Interpolation without Temporal Priors

NeurIPS 2020
0
citations

Hard Example Generation by Texture Synthesis for Cross-domain Shape Similarity Learning

NeurIPS 2020
0
citations

Domain Generalization via Entropy Regularization

NeurIPS 2020
0
citations

Class-Disentanglement and Applications in Adversarial Detection and Defense

NeurIPS 2021
0
citations

Gauge Equivariant Transformer

NeurIPS 2021
0
citations

ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias

NeurIPS 2021
0
citations

CGLB: Benchmark Tasks for Continual Graph Learning

NeurIPS 2022
0
citations

APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking

NeurIPS 2022
0
citations

Benefits of Permutation-Equivariance in Auction Mechanisms

NeurIPS 2022
0
citations

Escaping from the Barren Plateau via Gaussian Initializations in Deep Variational Quantum Circuits

NeurIPS 2022
0
citations

Adversarial Auto-Augment with Label Preservation: A Representation Learning Principle Guided Approach

NeurIPS 2022
0
citations

Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach

NeurIPS 2022
0
citations

Inducing Neural Collapse in Imbalanced Learning: Do We Really Need a Learnable Classifier at the End of Deep Neural Network?

NeurIPS 2022
0
citations

ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation

NeurIPS 2022
0
citations

VanillaNet: the Power of Minimalism in Deep Learning

NeurIPS 2023
0
citations

SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model

NeurIPS 2023
0
citations

Extending the Design Space of Graph Neural Networks by Rethinking Folklore Weisfeiler-Lehman

NeurIPS 2023
0
citations

MAG-GNN: Reinforcement Learning Boosted Graph Neural Network

NeurIPS 2023
0
citations

ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding

NeurIPS 2023
0
citations

Stability and Generalization of the Decentralized Stochastic Gradient Descent Ascent Algorithm

NeurIPS 2023
0
citations

Cocktail: Mixing Multi-Modality Control for Text-Conditional Image Generation

NeurIPS 2023
0
citations

Domain Re-Modulation for Few-Shot Generative Domain Adaptation

NeurIPS 2023
0
citations

Learning Better with Less: Effective Augmentation for Sample-Efficient Visual Reinforcement Learning

NeurIPS 2023
0
citations

All Points Matter: Entropy-Regularized Distribution Alignment for Weakly-supervised 3D Segmentation

NeurIPS 2023
0
citations

Understanding How Consistency Works in Federated Learning via Stage-wise Relaxed Initialization

NeurIPS 2023
0
citations

Discovering Temporal Causal Relations from Subsampled Data

ICML 2015
0
citations

Domain Adaptation with Conditional Transferable Components

ICML 2016
0
citations

Algorithmic Stability and Hypothesis Complexity

ICML 2017
0
citations

Beyond Filters: Compact Feature Map for Portable Deep Model

ICML 2017
0
citations