Xiaokang Yang

35
Papers
208
Total Citations

Papers (35)

VidToMe: Video Token Merging for Zero-Shot Video Editing

CVPR 2024
89
citations

Domain-Controlled Prompt Learning

AAAI 2024arXiv
30
citations

Domain Prompt Learning with Quaternion Networks

CVPR 2024
22
citations

Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction

ICCV 2025arXiv
16
citations

Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation

ICCV 2025arXiv
9
citations

Monocular Identity-Conditioned Facial Reflectance Reconstruction

CVPR 2024
7
citations

PostEdit: Posterior Sampling for Efficient Zero-Shot Image Editing

ICLR 2025
7
citations

Partial Label Learning with a Partner

AAAI 2024
6
citations

Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning

ICCV 2025
4
citations

Disentangled Clothed Avatar Generation with Layered Representation

ICCV 2025
3
citations

Tendency-driven Mutual Exclusivity for Weakly Supervised Incremental Semantic Segmentation

ECCV 2024
3
citations

AniSDF: Fused-Granularity Neural Surfaces with Anisotropic Encoding for High-Fidelity 3D Reconstruction

ICLR 2025
3
citations

Degradation-Modeled Multipath Diffusion for Tunable Metalens Photography

ICCV 2025
2
citations

Rethinking Classifier Re-Training in Long-Tailed Recognition: Label Over-Smooth Can Balance

ICLR 2025
2
citations

Latent Intuitive Physics: Learning to Transfer Hidden Physics from A 3D Video

ICLR 2024
2
citations

POMP: Physics-constrainable Motion Generative Model through Phase Manifolds

CVPR 2025
1
citations

Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions

ICCV 2025arXiv
1
citations

HyperET: Efficient Training in Hyperbolic Space for Multi-modal Large Language Models

NeurIPS 2025
1
citations

CasCast: Skillful High-resolution Precipitation Nowcasting via Cascaded Modelling

ICML 2024
0
citations

OSDFace: One-Step Diffusion Model for Face Restoration

CVPR 2025
0
citations

Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding

CVPR 2025
0
citations

Star with Bilinear Mapping

CVPR 2025
0
citations

Domain Generalization in CLIP via Learning with Diverse Text Prompts

CVPR 2025
0
citations

PassionSR: Post-Training Quantization with Adaptive Scale in One-Step Diffusion based Image Super-Resolution

CVPR 2025
0
citations

Generalized Tensor-based Parameter-Efficient Fine-Tuning via Lie Group Transformations

ICCV 2025
0
citations

QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation

ICCV 2025
0
citations

A Token-level Text Image Foundation Model for Document Understanding

ICCV 2025
0
citations

HAODiff: Human-Aware One-Step Diffusion via Dual-Prompt Guidance

NeurIPS 2025
0
citations

DAWP: A framework for global observation forecasting via Data Assimilation and Weather Prediction in satellite observation space

NeurIPS 2025
0
citations

FATE: Feature-Adapted Parameter Tuning for Vision-Language Models

AAAI 2025
0
citations

SAM-PARSER: Fine-Tuning SAM Efficiently by Parameter Space Reconstruction

AAAI 2024arXiv
0
citations

LERE: Learning-Based Low-Rank Matrix Recovery with Rank Estimation

AAAI 2024
0
citations

Inter-X: Towards Versatile Human-Human Interaction Analysis

CVPR 2024
0
citations

ReGenNet: Towards Human Action-Reaction Synthesis

CVPR 2024
0
citations

S^3-Face: SSS-Compliant Facial Reflectance Estimation via Diffusion Priors

CVPR 2025
0
citations