Hao Yang

40
Papers
153
Total Citations

Papers (40)

Goku: Flow Based Video Generative Foundation Models

CVPR 2025arXiv
53
citations

Language-driven All-in-one Adverse Weather Removal

CVPR 2024
48
citations

Translate Meanings, Not Just Words: IdiomKB’s Role in Optimizing Idiomatic Translation with Language Models

AAAI 2024arXiv
35
citations

UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer

ICCV 2025
15
citations

PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model

AAAI 2025
1
citations

Enhancing Numerical Prediction of MLLMs with Soft Labeling

ICCV 2025
1
citations

THRONE: An Object-based Hallucination Benchmark for the Free-form Generations of Large Vision-Language Models

CVPR 2024
0
citations

Exploit Bounding Box Annotations for Multi-Label Object Recognition

CVPR 2016
0
citations

Efficient 3D Room Shape Recovery From a Single Panorama

CVPR 2016
0
citations

MIML-FCN+: Multi-Instance Multi-Label Learning via Fully Convolutional Networks With Privileged Information

CVPR 2017
0
citations

Mask-Guided Portrait Editing With Conditional GANs

CVPR 2019
0
citations

Face Parsing With RoI Tanh-Warping

CVPR 2019
0
citations

Face X-Ray for More General Face Forgery Detection

CVPR 2020
0
citations

Advancing High Fidelity Identity Swapping for Forgery Detection

CVPR 2020
0
citations

Unsupervised Pre-Training for Person Re-Identification

CVPR 2021arXiv
0
citations

Style-Based Point Generator With Adversarial Rendering for Point Cloud Completion

CVPR 2021arXiv
0
citations

Omni-DETR: Omni-Supervised Object Detection With Transformers

CVPR 2022
0
citations

Large-Scale Pre-Training for Person Re-Identification With Noisy Labels

CVPR 2022arXiv
0
citations

General Facial Representation Learning in a Visual-Linguistic Manner

CVPR 2022arXiv
0
citations

ContraNeRF: Generalizable Neural Radiance Fields for Synthetic-to-Real Novel View Synthesis via Contrastive Learning

CVPR 2023arXiv
0
citations

MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining

CVPR 2023arXiv
0
citations

A Meta-Learning Approach to Predicting Performance and Data Requirements

CVPR 2023arXiv
0
citations

Guided Recommendation for Model Fine-Tuning

CVPR 2023
0
citations

Detecting 11K Classes: Large Scale Object Detection Without Fine-Grained Bounding Boxes

ICCV 2019
0
citations

Adversarial Example Detection Using Latent Neighborhood Graph

ICCV 2021
0
citations

ADNet: Leveraging Error-Bias Towards Normal Direction in Face Alignment

ICCV 2021arXiv
0
citations

InterFormer: Real-time Interactive Image Segmentation

ICCV 2023arXiv
0
citations

Local and Global Logit Adjustments for Long-Tailed Learning

ICCV 2023
0
citations

Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark

ECCV 2022
0
citations

Real-Time Neural Character Rendering with Pose-Guided Multiplane Images

ECCV 2022
0
citations

Scaling up Image Segmentation across Data and Tasks

CVPR 2025
0
citations

Met2Net: A Decoupled Two-Stage Spatio-Temporal Forecasting Model for Complex Meteorological Systems

ICCV 2025
0
citations

ZeroStereo: Zero-shot Stereo Matching from Single Images

ICCV 2025
0
citations

Test-Time Adaptation on Noisy Data via Model-Pruning-Based Filtering and Flatness-Aware Entropy Minimization

AAAI 2025
0
citations

AvatarVerse: High-Quality & Stable 3D Avatar Creation from Text and Pose

AAAI 2024
0
citations

LDP: Language-driven Dual-Pixel Image Defocus Deblurring Network

CVPR 2024
0
citations

Delayed Propagation Transformer: A Universal Computation Engine towards Practical Control in Cyber-Physical Systems

NeurIPS 2021
0
citations

Your representations are in the network: composable and parallel adaptation for large scale models

NeurIPS 2023
0
citations

PRED: Pre-training via Semantic Rendering on LiDAR Point Clouds

NeurIPS 2023
0
citations

From Trainable Negative Depth to Edge Heterophily in Graphs

NeurIPS 2023
0
citations