Dacheng Tao
217
Papers
1,232
Total Citations
Papers (217)
MUlti-Store Tracker (MUSTer): A Cognitive Psychology Inspired Approach to Object Tracking
CVPR 2015
637
citations
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
ICCV 2025
206
citations
CNNpack: Packing Convolutional Neural Networks in the Frequency Domain
NeurIPS 2016
197
citations
Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models
AAAI 2025
73
citations
Revisiting Backdoor Attacks against Large Vision-Language Models from Domain Shift
CVPR 2025
25
citations
Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages
ICLR 2024
25
citations
SimDistill: Simulated Multi-Modal Distillation for BEV 3D Object Detection
AAAI 2024arXiv
24
citations
One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls
CVPR 2024
9
citations
Synergy of Sight and Semantics: Visual Intention Understanding with CLIP
ECCV 2024
7
citations
MMReason: An Open-Ended Multi-Modal Multi-Step Reasoning Benchmark for MLLMs Toward AGI
ICCV 2025
7
citations
Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
ICML 2025
6
citations
Free-Form Motion Control: Controlling the 6D Poses of Camera and Objects in Video Generation
ICCV 2025
4
citations
Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer
ICML 2025
4
citations
ICLShield: Exploring and Mitigating In-Context Learning Backdoor Attacks
ICML 2025
2
citations
Learning system dynamics without forgetting
ICLR 2025
2
citations
Adaptive Defense against Harmful Fine-Tuning for Large Language Models via Bayesian Data Scheduler
NeurIPS 2025
2
citations
AiDE-Q: Synthetic Labeled Datasets Can Enhance Learning Models for Quantum Property Estimation
NeurIPS 2025
1
citations
LLM Data Selection and Utilization via Dynamic Bi-level Optimization
ICML 2025
1
citations
Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications
ICML 2024
0
citations
Merging Multi-Task Models via Weight-Ensembling Mixture of Experts
ICML 2024
0
citations
Generalization Analysis of Stochastic Weight Averaging with General Sampling
ICML 2024
0
citations
Task Groupings Regularization: Data-Free Meta-Learning with Heterogeneous Pre-trained Models
ICML 2024
0
citations
Representation Surgery for Multi-Task Model Merging
ICML 2024
0
citations
Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases
ICML 2024
0
citations
Saliency Propagation From Simple to Difficult
CVPR 2015
0
citations
FaLRR: A Fast Low Rank Representation Solver
CVPR 2015
0
citations
A Maximum Entropy Feature Descriptor for Age Invariant Face Recognition
CVPR 2015
0
citations
Occlusion Boundary Detection via Deep Exploration of Context
CVPR 2016
0
citations
Part-Stacked CNN for Fine-Grained Visual Categorization
CVPR 2016
0
citations
Conditional Graphical Lasso for Multi-Label Image Classification
CVPR 2016
0
citations
Multilinear Hyperplane Hashing
CVPR 2016
0
citations
Improving Training of Deep Neural Networks via Singular Value Bounding
CVPR 2017arXiv
0
citations
On Compressing Deep Models by Low Rank and Sparse Decomposition
CVPR 2017
0
citations
Geometry-Aware Scene Text Detection With Instance Transformation Network
CVPR 2018
0
citations
Deep Ordinal Regression Network for Monocular Depth Estimation
CVPR 2018arXiv
0
citations
Self-Supervised Adversarial Hashing Networks for Cross-Modal Retrieval
CVPR 2018arXiv
0
citations
An Efficient and Provable Approach for Mixture Proportion Estimation Using Linear Independence Assumption
CVPR 2018
0
citations
LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs
CVPR 2025
0
citations
Geometry-Consistent Generative Adversarial Networks for One-Sided Unsupervised Domain Mapping
CVPR 2019
0
citations
DistillHash: Unsupervised Deep Hashing by Distilling Data Pairs
CVPR 2019
0
citations
On Exploring Undetermined Relationships for Visual Relationship Detection
CVPR 2019
0
citations
Deep Modular Co-Attention Networks for Visual Question Answering
CVPR 2019
0
citations
World From Blur
CVPR 2019
0
citations
Geometry-Aware Symmetric Domain Adaptation for Monocular Depth Estimation
CVPR 2019
0
citations
Self-Supervised Representation Learning by Rotation Feature Decoupling
CVPR 2019
0
citations
Image-Question-Answer Synergistic Network for Visual Dialog
CVPR 2019
0
citations
Fast Spatio-Temporal Residual Network for Video Super-Resolution
CVPR 2019
0
citations
GPS-Net: Graph Property Sensing Network for Scene Graph Generation
CVPR 2020
0
citations
Recurrent Feature Reasoning for Image Inpainting
CVPR 2020arXiv
0
citations
On Positive-Unlabeled Classification in GAN
CVPR 2020arXiv
0
citations
Distilling Knowledge From Graph Convolutional Networks
CVPR 2020arXiv
0
citations
Learning Oracle Attention for High-Fidelity Face Completion
CVPR 2020arXiv
0
citations
Syntax-Aware Action Targeting for Video Captioning
CVPR 2020
0
citations
Context Aware Graph Convolution for Skeleton-Based Action Recognition
CVPR 2020
0
citations
FeatureFlow: Robust Video Interpolation via Structure-to-Texture Generation
CVPR 2020
0
citations
Learning Unseen Concepts via Hierarchical Decomposition and Composition
CVPR 2020
0
citations
PuppeteerGAN: Arbitrary Portrait Animation With Semantic-Aware Appearance Transformation
CVPR 2020
0
citations
AdderSR: Towards Energy Efficient Image Super-Resolution
CVPR 2021arXiv
0
citations
Online Multiple Object Tracking With Cross-Task Synergy
CVPR 2021arXiv
0
citations
Scene Essence
CVPR 2021
0
citations
HourNAS: Extremely Fast Neural Architecture Search Through an Hourglass Lens
CVPR 2021arXiv
0
citations
Tree-Like Decision Distillation
CVPR 2021
0
citations
Learning Progressive Point Embeddings for 3D Point Cloud Generation
CVPR 2021
0
citations
Turning Frequency to Resolution: Video Super-Resolution via Event Cameras
CVPR 2021
0
citations
Glance and Gaze: Inferring Action-Aware Points for One-Stage Human-Object Interaction Detection
CVPR 2021arXiv
0
citations
Where and What? Examining Interpretable Disentangled Representations
CVPR 2021arXiv
0
citations
Detecting Human-Object Interaction via Fabricated Compositional Learning
CVPR 2021arXiv
0
citations
Affordance Transfer Learning for Human-Object Interaction Detection
CVPR 2021arXiv
0
citations
Manifold Regularized Dynamic Network Pruning
CVPR 2021arXiv
0
citations
Amalgamating Knowledge From Heterogeneous Graph Neural Networks
CVPR 2021
0
citations
Contrastive Boundary Learning for Point Cloud Segmentation
CVPR 2022arXiv
0
citations
Alleviating Semantics Distortion in Unsupervised Low-Level Image-to-Image Translation via Structure Consistency Constraint
CVPR 2022
0
citations
BatchFormer: Learning To Explore Sample Relationships for Robust Representation Learning
CVPR 2022arXiv
0
citations
DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers
CVPR 2022arXiv
0
citations
GMFlow: Learning Optical Flow via Global Matching
CVPR 2022arXiv
0
citations
Recurrent Glimpse-Based Decoder for Detection With Transformer
CVPR 2022arXiv
0
citations
Learning To Collaborate in Decentralized Learning of Personalized Models
CVPR 2022
0
citations
ART-Point: Improving Rotation Robustness of Point Cloud Classifiers via Adversarial Rotation
CVPR 2022
0
citations
Source-Free Domain Adaptation via Distribution Estimation
CVPR 2022arXiv
0
citations
Distillation Using Oracle Queries for Transformer-Based Human-Object Interaction Detection
CVPR 2022
0
citations
Defensive Patches for Robust Recognition in the Physical World
CVPR 2022arXiv
0
citations
HL-Net: Heterophily Learning Network for Scene Graph Generation
CVPR 2022
0
citations
Modeling Image Composition for Complex Scene Generation
CVPR 2022arXiv
0
citations
Learning Affordance Grounding From Exocentric Images
CVPR 2022arXiv
0
citations
Few-Shot Backdoor Defense Using Shapley Estimation
CVPR 2022arXiv
0
citations
Patch Slimming for Efficient Vision Transformers
CVPR 2022arXiv
0
citations
RU-Net: Regularized Unrolling Network for Scene Graph Generation
CVPR 2022
0
citations
Continual Learning With Lifelong Vision Transformer
CVPR 2022
0
citations
Self-Augmented Unpaired Image Dehazing via Density and Depth Decomposition
CVPR 2022
0
citations
FIBA: Frequency-Injection Based Backdoor Attack in Medical Image Analysis
CVPR 2022arXiv
0
citations
Bridged Transformer for Vision and Point Cloud 3D Object Detection
CVPR 2022
0
citations
Fine-Tuning Global Model via Data-Free Knowledge Distillation for Non-IID Federated Learning
CVPR 2022arXiv
0
citations
Dynamic Focus-Aware Positional Queries for Semantic Segmentation
CVPR 2023arXiv
0
citations
Leverage Interactive Affinity for Affordance Learning
CVPR 2023
0
citations
Upcycling Models Under Domain and Category Shift
CVPR 2023
0
citations
Learnable Skeleton-Aware 3D Point Cloud Sampling
CVPR 2023
0
citations
CLAMP: Prompt-Based Contrastive Learning for Connecting Language and Animal Pose
CVPR 2023arXiv
0
citations
Exploring the Relationship Between Architectural Design and Adversarially Robust Generalization
CVPR 2023
0
citations
Generating Holistic 3D Human Motion From Speech
CVPR 2023arXiv
0
citations
Architecture, Dataset and Model-Scale Agnostic Data-Free Meta-Learning
CVPR 2023arXiv
0
citations
DeepSolo: Let Transformer Decoder With Explicit Points Solo for Text Spotting
CVPR 2023arXiv
0
citations
Make Landscape Flatter in Differentially Private Federated Learning
CVPR 2023arXiv
0
citations
From Images to Textual Prompts: Zero-Shot Visual Question Answering With Frozen Large Language Models
CVPR 2023
0
citations
Deep Graph Reprogramming
CVPR 2023arXiv
0
citations
TriDet: Temporal Action Detection With Relative Boundary Modeling
CVPR 2023arXiv
0
citations
Referring Image Matting
CVPR 2023arXiv
0
citations
Augmenting Strong Supervision Using Web Data for Fine-Grained Categorization
ICCV 2015
0
citations
Multi-Modal Factorized Bilinear Pooling With Co-Attention Learning for Visual Question Answering
ICCV 2017arXiv
0
citations
Centered Weight Normalization in Accelerating Training of Deep Neural Networks
ICCV 2017
0
citations
A Coarse-Fine Network for Keypoint Localization
ICCV 2017
0
citations
A Joint Intrinsic-Extrinsic Prior Model for Retinex
ICCV 2017
0
citations
Self-Supervised Representation Learning From Multi-Domain Data
ICCV 2019
0
citations
Approximated Bilinear Modules for Temporal Modeling
ICCV 2019
0
citations
Asymmetric Cross-Guided Attention Network for Actor and Action Video Segmentation From Natural Language Query
ICCV 2019
0
citations
Progressive Reconstruction of Visual Structure for Image Inpainting
ICCV 2019
0
citations
Deep Reinforcement Active Learning for Human-in-the-Loop Person Re-Identification
ICCV 2019
0
citations
Deep Metric Learning With Tuplet Margin Loss
ICCV 2019
0
citations
Not All Parts Are Created Equal: 3D Pose Estimation by Modeling Bi-Directional Dependencies of Body Parts
ICCV 2019
0
citations
Learning a Mixture of Granularity-Specific Experts for Fine-Grained Categorization
ICCV 2019
0
citations
Collect and Select: Semantic Alignment Metric Learning for Few-Shot Learning
ICCV 2019
0
citations
Out-of-Boundary View Synthesis Towards Full-Frame Video Stabilization
ICCV 2021arXiv
0
citations
Meta-Aggregator: Learning To Aggregate for 1-Bit Graph Neural Networks
ICCV 2021
0
citations
Adaptive Curriculum Learning
ICCV 2021
0
citations
SynFace: Face Recognition With Synthetic Data
ICCV 2021arXiv
0
citations
Stochastic Partial Swap: Enhanced Model Generalization and Interpretability for Fine-Grained Recognition
ICCV 2021
0
citations
TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition
ICCV 2021arXiv
0
citations
DDS2M: Self-Supervised Denoising Diffusion Spatio-Spectral Model for Hyperspectral Image Restoration
ICCV 2023arXiv
0
citations
Exploring Temporal Concurrency for Video-Language Representation Learning
ICCV 2023
0
citations
Domain Specified Optimization for Deployment Authorization
ICCV 2023
0
citations
Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning
ICCV 2023arXiv
0
citations
Knowledge-Aware Federated Active Learning with Non-IID Data
ICCV 2023arXiv
0
citations
Class-Aware Patch Embedding Adaptation for Few-Shot Image Classification
ICCV 2023
0
citations
Short-Term and Long-Term Context Aggregation Network for Video Inpainting
ECCV 2020
0
citations
Hallucinating Visual Instances in Total Absentia
ECCV 2020
0
citations
Learning Disentangled Representations with Latent Variation Predictability
ECCV 2020
0
citations
Symbiotic Adversarial Learning for Attribute-based Person Search
ECCV 2020
0
citations
Visual Compositional Learning for Human-Object Interaction Detection
ECCV 2020
0
citations
Spatiotemporal Attacks for Embodied Agents
ECCV 2020
0
citations
Polysemy Deciphering Network for Human-Object Interaction Detection
ECCV 2020
0
citations
Learning Propagation Rules for Attribution Map Generation
ECCV 2020
0
citations
On Dropping Clusters to Regularize Graph Convolutional Neural Networks
ECCV 2020
0
citations
Learning Graph Neural Networks for Image Style Transfer
ECCV 2022
0
citations
Towards Data-Efficient Detection Transformers
ECCV 2022
0
citations
ReAct: Temporal Action Detection with Relational Queries
ECCV 2022
0
citations
Online Continual Learning with Contrastive Vision Transformer
ECCV 2022
0
citations
VSA: Learning Varied-Size Window Attention in Vision Transformers
ECCV 2022
0
citations
Balancing Stability and Plasticity through Advanced Null Space in Continual Learning
ECCV 2022
0
citations
Discovering Human-Object Interaction Concepts via Self-Compositional Learning
ECCV 2022
0
citations
PolyphonicFormer: Unified Query Learning for Depth-Aware Video Panoptic Segmentation
ECCV 2022
0
citations
Panoptic-PartFormer: Learning a Unified Model for Panoptic Part Segmentation
ECCV 2022
0
citations
RegionCL: Exploring Contrastive Region Pairs for Self-Supervised Representation Learning
ECCV 2022
0
citations
BMD: A General Class-Balanced Multicentric Dynamic Prototype Strategy for Source-Free Domain Adaptation
ECCV 2022
0
citations
"Fashionformer: A Simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition"
ECCV 2022
0
citations
"Towards Scale-Aware, Robust, and Generalizable Unsupervised Monocular Depth Estimation by Integrating IMU Motion Dynamics"
ECCV 2022
0
citations
ST-P3: End-to-End Vision-Based Autonomous Driving via Spatial-Temporal Feature Learning
ECCV 2022
0
citations
"JPerceiver: Joint Perception Network for Depth, Pose and Layout Estimation in Driving Scenes"
ECCV 2022
0
citations
MirrorGAN: Learning Text-To-Image Generation by Redescription
CVPR 2019
0
citations
Neuron: Learning Context-Aware Evolving Representations for Zero-Shot Skeleton Action Recognition
CVPR 2025
0
citations
Harnessing Text-to-Image Diffusion Models for Point Cloud Self-Supervised Learning
ICCV 2025
0
citations
CopyrightShield: Enhancing Diffusion Model Security Against Copyright Infringement Attacks
ICCV 2025
0
citations
Rethink Sparse Signals for Pose-guided Text-to-image Generation
ICCV 2025
0
citations
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning
NeurIPS 2025
0
citations
Improving Complex Reasoning over Knowledge Graph with Logic-Aware Curriculum Tuning
AAAI 2025
0
citations
Modeling All Response Surfaces in One for Conditional Search Spaces
AAAI 2025
0
citations
TD²-Net: Toward Denoising and Debiasing for Video Scene Graph Generation
AAAI 2024
0
citations
Multi-Step Denoising Scheduled Sampling: Towards Alleviating Exposure Bias for Diffusion Models
AAAI 2024
0
citations
Sheared Backpropagation for Fine-tuning Foundation Models
CVPR 2024
0
citations
UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather
CVPR 2024
0
citations
FREE: Faster and Better Data-Free Meta-Learning
CVPR 2024
0
citations
Local-consistent Transformation Learning for Rotation-invariant Point Cloud Analysis
CVPR 2024
0
citations
Learn from Downstream and Be Yourself in Multimodal Large Language Models Fine-Tuning
ICML 2025
0
citations
HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning
ICML 2024
0
citations
Q-value Regularized Transformer for Offline Reinforcement Learning
ICML 2024
0
citations
Towards Theoretical Understandings of Self-Consuming Generative Models
ICML 2024
0
citations
Learning Versatile Filters for Efficient Convolutional Neural Networks
NeurIPS 2018
0
citations
Dual Swap Disentangling
NeurIPS 2018
0
citations
LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning
NeurIPS 2019
0
citations
Theoretical Analysis of Adversarial Learning: A Minimax Approach
NeurIPS 2019
0
citations
Likelihood-Free Overcomplete ICA and Applications In Causal Discovery
NeurIPS 2019arXiv
0
citations
Category Anchor-Guided Unsupervised Domain Adaptation for Semantic Segmentation
NeurIPS 2019
0
citations
Learning from Bad Data via Generation
NeurIPS 2019
0
citations
Positive-Unlabeled Compression on the Cloud
NeurIPS 2019
0
citations
Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge
NeurIPS 2019
0
citations
Control Batch Size and Learning Rate to Generalize Well: Theoretical and Empirical Evidence
NeurIPS 2019
0
citations
Auto Learning Attention
NeurIPS 2020
0
citations
Searching for Low-Bit Weights in Quantized Neural Networks
NeurIPS 2020
0
citations
Part-dependent Label Noise: Towards Instance-dependent Label Noise
NeurIPS 2020
0
citations
SCOP: Scientific Control for Reliable Neural Network Pruning
NeurIPS 2020
0
citations
Video Frame Interpolation without Temporal Priors
NeurIPS 2020
0
citations
Hard Example Generation by Texture Synthesis for Cross-domain Shape Similarity Learning
NeurIPS 2020
0
citations
Domain Generalization via Entropy Regularization
NeurIPS 2020
0
citations
Class-Disentanglement and Applications in Adversarial Detection and Defense
NeurIPS 2021
0
citations
Gauge Equivariant Transformer
NeurIPS 2021
0
citations
ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias
NeurIPS 2021
0
citations
CGLB: Benchmark Tasks for Continual Graph Learning
NeurIPS 2022
0
citations
APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking
NeurIPS 2022
0
citations
Benefits of Permutation-Equivariance in Auction Mechanisms
NeurIPS 2022
0
citations
Escaping from the Barren Plateau via Gaussian Initializations in Deep Variational Quantum Circuits
NeurIPS 2022
0
citations
Adversarial Auto-Augment with Label Preservation: A Representation Learning Principle Guided Approach
NeurIPS 2022
0
citations
Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach
NeurIPS 2022
0
citations
Inducing Neural Collapse in Imbalanced Learning: Do We Really Need a Learnable Classifier at the End of Deep Neural Network?
NeurIPS 2022
0
citations
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
NeurIPS 2022
0
citations
VanillaNet: the Power of Minimalism in Deep Learning
NeurIPS 2023
0
citations
SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model
NeurIPS 2023
0
citations
Extending the Design Space of Graph Neural Networks by Rethinking Folklore Weisfeiler-Lehman
NeurIPS 2023
0
citations
MAG-GNN: Reinforcement Learning Boosted Graph Neural Network
NeurIPS 2023
0
citations
ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding
NeurIPS 2023
0
citations
Stability and Generalization of the Decentralized Stochastic Gradient Descent Ascent Algorithm
NeurIPS 2023
0
citations
Cocktail: Mixing Multi-Modality Control for Text-Conditional Image Generation
NeurIPS 2023
0
citations
Domain Re-Modulation for Few-Shot Generative Domain Adaptation
NeurIPS 2023
0
citations
Learning Better with Less: Effective Augmentation for Sample-Efficient Visual Reinforcement Learning
NeurIPS 2023
0
citations
All Points Matter: Entropy-Regularized Distribution Alignment for Weakly-supervised 3D Segmentation
NeurIPS 2023
0
citations
Understanding How Consistency Works in Federated Learning via Stage-wise Relaxed Initialization
NeurIPS 2023
0
citations
Discovering Temporal Causal Relations from Subsampled Data
ICML 2015
0
citations
Domain Adaptation with Conditional Transferable Components
ICML 2016
0
citations
Algorithmic Stability and Hypothesis Complexity
ICML 2017
0
citations
Beyond Filters: Compact Feature Map for Portable Deep Model
ICML 2017
0
citations