Yao Zhao

28
Papers
126
Total Citations

Papers (28)

Towards the Uncharted: Density-Descending Feature Perturbation for Semi-supervised Semantic Segmentation

CVPR 2024
30
citations

Frozen CLIP: A Strong Backbone for Weakly Supervised Semantic Segmentation

CVPR 2024
29
citations

VideoWorld: Exploring Knowledge Learning from Unlabeled Videos

CVPR 2025
28
citations

Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation

NeurIPS 2025
12
citations

Lyapunov-Stable Deep Equilibrium Models

AAAI 2024arXiv
7
citations

EvEnhancer: Empowering Effectiveness, Efficiency and Generalizability for Continuous Space-Time Video Super-Resolution with Events

CVPR 2025
6
citations

Dual-view X-ray Detection: Can AI Detect Prohibited Items from Dual-view X-ray Images like Humans?

CVPR 2025
4
citations

ODDN: Addressing Unpaired Data Challenges in Open-World Deepfake Detection on Online Social Networks

AAAI 2025
4
citations

Collapsed Language Models Promote Fairness

ICLR 2025
1
citations

Transferable and Principled Efficiency for Open-Vocabulary Segmentation

CVPR 2024
1
citations

NTClick: Achieving Precise Interactive Segmentation With Noise-tolerant Clicks

CVPR 2025
1
citations

Attend and Enrich: Enhanced Visual Prompt for Zero-Shot Learning

AAAI 2025
1
citations

Unsupervised Region-Based Image Editing of Denoising Diffusion Models

AAAI 2025
1
citations

Visual Relation Diffusion for Human-Object Interaction Detection

ICCV 2025
1
citations

Fixing the Loose Brake: Exponential-Tailed Stopping Time in Best Arm Identification

ICML 2025
0
citations

CLIP-GS: Unifying Vision-Language Representation with 3D Gaussian Splatting

ICCV 2025
0
citations

ReCoT: Reflective Self-Correction Training for Mitigating Confirmation Bias in Large Vision-Language Models

ICCV 2025
0
citations

CharaConsist: Fine-Grained Consistent Character Generation

ICCV 2025
0
citations

PixelStitch: Structure-Preserving Pixel-Wise Bidirectional Warps for Unsupervised Image Stitching

ICCV 2025
0
citations

Memory Efficient Matting with Adaptive Token Routing

AAAI 2025
0
citations

C2P-CLIP: Injecting Category Common Prompt in CLIP to Enhance Generalization in Deepfake Detection

AAAI 2025
0
citations

CSR:Achieving 1 Bit Key-Value Cache via Sparse Representation

AAAI 2025
0
citations

On the Unstable Convergence Regime of Gradient Descent

AAAI 2024
0
citations

Frequency-Aware Deepfake Detection: Improving Generalizability through Frequency Domain Learning

AAAI 2024
0
citations

Endow SAM with Keen Eyes: Temporal-spatial Prompt Learning for Video Camouflaged Object Detection

CVPR 2024
0
citations

Forgery-aware Adaptive Transformer for Generalizable Synthetic Image Detection

CVPR 2024
0
citations

PixelLM: Pixel Reasoning with Large Multimodal Model

CVPR 2024
0
citations

Rethinking the Up-Sampling Operations in CNN-based Generative Network for Generalizable Deepfake Detection

CVPR 2024
0
citations