Dacheng Tao
40
Papers
398
Total Citations
Papers (40)
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
ICCV 2025
206
citations
Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models
AAAI 2025
73
citations
Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages
ICLR 2024
25
citations
Revisiting Backdoor Attacks against Large Vision-Language Models from Domain Shift
CVPR 2025
25
citations
SimDistill: Simulated Multi-Modal Distillation for BEV 3D Object Detection
AAAI 2024arXiv
24
citations
One More Step: A Versatile Plug-and-Play Module for Rectifying Diffusion Schedule Flaws and Enhancing Low-Frequency Controls
CVPR 2024
9
citations
Synergy of Sight and Semantics: Visual Intention Understanding with CLIP
ECCV 2024
7
citations
MMReason: An Open-Ended Multi-Modal Multi-Step Reasoning Benchmark for MLLMs Toward AGI
ICCV 2025
7
citations
Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
ICML 2025
6
citations
Free-Form Motion Control: Controlling the 6D Poses of Camera and Objects in Video Generation
ICCV 2025
4
citations
Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer
ICML 2025
4
citations
Adaptive Defense against Harmful Fine-Tuning for Large Language Models via Bayesian Data Scheduler
NeurIPS 2025
2
citations
ICLShield: Exploring and Mitigating In-Context Learning Backdoor Attacks
ICML 2025
2
citations
Learning system dynamics without forgetting
ICLR 2025
2
citations
LLM Data Selection and Utilization via Dynamic Bi-level Optimization
ICML 2025
1
citations
AiDE-Q: Synthetic Labeled Datasets Can Enhance Learning Models for Quantum Property Estimation
NeurIPS 2025
1
citations
Q-value Regularized Transformer for Offline Reinforcement Learning
ICML 2024
0
citations
Towards Theoretical Understandings of Self-Consuming Generative Models
ICML 2024
0
citations
Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications
ICML 2024
0
citations
Merging Multi-Task Models via Weight-Ensembling Mixture of Experts
ICML 2024
0
citations
Generalization Analysis of Stochastic Weight Averaging with General Sampling
ICML 2024
0
citations
Task Groupings Regularization: Data-Free Meta-Learning with Heterogeneous Pre-trained Models
ICML 2024
0
citations
Representation Surgery for Multi-Task Model Merging
ICML 2024
0
citations
LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs
CVPR 2025
0
citations
Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases
ICML 2024
0
citations
Neuron: Learning Context-Aware Evolving Representations for Zero-Shot Skeleton Action Recognition
CVPR 2025
0
citations
Harnessing Text-to-Image Diffusion Models for Point Cloud Self-Supervised Learning
ICCV 2025
0
citations
CopyrightShield: Enhancing Diffusion Model Security Against Copyright Infringement Attacks
ICCV 2025
0
citations
Rethink Sparse Signals for Pose-guided Text-to-image Generation
ICCV 2025
0
citations
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning
NeurIPS 2025
0
citations
Improving Complex Reasoning over Knowledge Graph with Logic-Aware Curriculum Tuning
AAAI 2025
0
citations
Modeling All Response Surfaces in One for Conditional Search Spaces
AAAI 2025
0
citations
TD²-Net: Toward Denoising and Debiasing for Video Scene Graph Generation
AAAI 2024
0
citations
Multi-Step Denoising Scheduled Sampling: Towards Alleviating Exposure Bias for Diffusion Models
AAAI 2024
0
citations
Sheared Backpropagation for Fine-tuning Foundation Models
CVPR 2024
0
citations
UniMix: Towards Domain Adaptive and Generalizable LiDAR Semantic Segmentation in Adverse Weather
CVPR 2024
0
citations
FREE: Faster and Better Data-Free Meta-Learning
CVPR 2024
0
citations
Local-consistent Transformation Learning for Rotation-invariant Point Cloud Analysis
CVPR 2024
0
citations
Learn from Downstream and Be Yourself in Multimodal Large Language Models Fine-Tuning
ICML 2025
0
citations
HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning
ICML 2024
0
citations