Yang Zhang

57
Papers
433
Total Citations

Papers (57)

Dilated Recurrent Neural Networks

NeurIPS 2017arXiv
338
citations

HMD-Poser: On-Device Real-time Human Motion Tracking from Scalable Sparse Observations

CVPR 2024
21
citations

Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models

ECCV 2024
19
citations

Online Preference Alignment for Language Models via Count-based Exploration

ICLR 2025
19
citations

Correcting Diffusion Generation through Resampling

CVPR 2024
12
citations

CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance

ICML 2025
11
citations

Task-Agnostic Pre-training and Task-Guided Fine-tuning for Versatile Diffusion Planner

ICML 2025
4
citations

Fictitious Synthetic Data Can Improve LLM Factuality via Prerequisite Learning

ICLR 2025
4
citations

IGD: Token Decisiveness Modeling via Information Gain in LLMs for Personalized Recommendation

NeurIPS 2025
3
citations

Plug-in Feedback Self-adaptive Attention in CLIP for Training-free Open-Vocabulary Segmentation

ICCV 2025
1
citations

Can Large Multimodal Models Understand Agricultural Scenes? Benchmarking with AgroMind

NeurIPS 2025
1
citations

Collaborative Weakly Supervised Video Correlation Learning for Procedure-Aware Instructional Video Analysis

AAAI 2024
0
citations

Polyper: Boundary Sensitive Polyp Segmentation

AAAI 2024arXiv
0
citations

APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation

CVPR 2024
0
citations

Accurate LoRA-Finetuning Quantization of LLMs via Information Retention

ICML 2024
0
citations

Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning

ICML 2024
0
citations

The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright BreachesWithout Adjusting Finetuning Pipeline

ICML 2024
0
citations

Speech Self-Supervised Learning Using Diffusion Model Synthetic Data

ICML 2024
0
citations

Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling

ICML 2024
0
citations

Fast Zero-Shot Image Tagging

CVPR 2016
0
citations

PolarNet: An Improved Grid Representation for Online LiDAR Point Clouds Semantic Segmentation

CVPR 2020arXiv
0
citations

Copy and Paste GAN: Face Hallucination From Shaded Thumbnails

CVPR 2020arXiv
0
citations

Panoptic-PolarNet: Proposal-Free LiDAR Point Cloud Panoptic Segmentation

CVPR 2021
0
citations

The Lottery Tickets Hypothesis for Supervised and Self-Supervised Pre-Training in Computer Vision Models

CVPR 2021arXiv
0
citations

SAP-DETR: Bridging the Gap Between Salient Points and Queries-Based Transformer Detector for Fast Model Convergency

CVPR 2023
0
citations

Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models

CVPR 2023arXiv
0
citations

Can't Steal? Cont-Steal! Contrastive Stealing Attacks Against Image Encoders

CVPR 2023
0
citations

Curriculum Domain Adaptation for Semantic Segmentation of Urban Scenes

ICCV 2017arXiv
0
citations

Domain Randomization and Pyramid Consistency: Simulation-to-Real Generalization Without Accessing Target Domain Data

ICCV 2019
0
citations

SACoD: Sensor Algorithm Co-Design Towards Efficient CNN-Powered Intelligent PhlatCam

ICCV 2021
0
citations

A General Recurrent Tracking Framework Without Real Data

ICCV 2021
0
citations

Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis

ICCV 2023arXiv
0
citations

TempFormer: Temporally Consistent Transformer for Video Denoising

ECCV 2022
0
citations

Semi-Leak: Membership Inference Attacks against Semi-Supervised Learning

ECCV 2022
0
citations

DoGA: Enhancing Grounded Object Detection via Grouped Pre-Training with Attributes

AAAI 2025
0
citations

VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for MLLMs

ICCV 2025
0
citations

LDIP: Long Distance Information Propagation for Video Super-Resolution

ICCV 2025
0
citations

Event-guided HDR Reconstruction with Diffusion Priors

ICCV 2025
0
citations

Anti-Tamper Protection for Unauthorized Individual Image Generation

ICCV 2025
0
citations

LOTA: Bit-Planes Guided AI-Generated Image Detection

ICCV 2025
0
citations

Hate in Plain Sight: On the Risks of Moderating AI-Generated Hateful Illusions

ICCV 2025
0
citations

EMHI: A Multimodal Egocentric Human Motion Dataset with HMD and Body-Worn IMUs

AAAI 2025
0
citations

Bridging the Gap between Gaussian Diffusion Models and Universal Quantization for Image Compression

CVPR 2025
0
citations

VIoTGPT: Learning to Schedule Vision Tools Towards Intelligent Video Internet of Things

AAAI 2025
0
citations

Behavior Importance-Aware Graph Neural Architecture Search for Cross-Domain Recommendation

AAAI 2025
0
citations

A Game Theoretic Approach to Class-wise Selective Rationalization

NeurIPS 2019
0
citations

The Lottery Ticket Hypothesis for Pre-trained BERT Networks

NeurIPS 2020
0
citations

Understanding Interlocking Dynamics of Cooperative Rationalization

NeurIPS 2021
0
citations

Drawing Robust Scratch Tickets: Subnetworks with Inborn Robustness Are Found within Randomly Initialized Networks

NeurIPS 2021
0
citations

Fine-Grained Neural Network Explanation by Identifying Input Features with Predictive Information

NeurIPS 2021
0
citations

BCORLE($\lambda$): An Offline Reinforcement Learning and Evaluation Framework for Coupons Allocation in E-commerce Market

NeurIPS 2021
0
citations

PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition

NeurIPS 2021
0
citations

Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing

NeurIPS 2022
0
citations

Amplifying Membership Exposure via Data Poisoning

NeurIPS 2022
0
citations

Fairness Reprogramming

NeurIPS 2022
0
citations

A Theoretical Explanation for Perplexing Behaviors of Backpropagation-based Visualizations

ICML 2018
0
citations

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

ICML 2019
0
citations