Xiaokang Yang
35
Papers
208
Total Citations
Papers (35)
VidToMe: Video Token Merging for Zero-Shot Video Editing
CVPR 2024
89
citations
Domain-Controlled Prompt Learning
AAAI 2024arXiv
30
citations
Domain Prompt Learning with Quaternion Networks
CVPR 2024
22
citations
Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction
ICCV 2025arXiv
16
citations
Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation
ICCV 2025arXiv
9
citations
Monocular Identity-Conditioned Facial Reflectance Reconstruction
CVPR 2024
7
citations
PostEdit: Posterior Sampling for Efficient Zero-Shot Image Editing
ICLR 2025
7
citations
Partial Label Learning with a Partner
AAAI 2024
6
citations
Disentangled World Models: Learning to Transfer Semantic Knowledge from Distracting Videos for Reinforcement Learning
ICCV 2025
4
citations
Disentangled Clothed Avatar Generation with Layered Representation
ICCV 2025
3
citations
Tendency-driven Mutual Exclusivity for Weakly Supervised Incremental Semantic Segmentation
ECCV 2024
3
citations
AniSDF: Fused-Granularity Neural Surfaces with Anisotropic Encoding for High-Fidelity 3D Reconstruction
ICLR 2025
3
citations
Degradation-Modeled Multipath Diffusion for Tunable Metalens Photography
ICCV 2025
2
citations
Rethinking Classifier Re-Training in Long-Tailed Recognition: Label Over-Smooth Can Balance
ICLR 2025
2
citations
Latent Intuitive Physics: Learning to Transfer Hidden Physics from A 3D Video
ICLR 2024
2
citations
POMP: Physics-constrainable Motion Generative Model through Phase Manifolds
CVPR 2025
1
citations
Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions
ICCV 2025arXiv
1
citations
HyperET: Efficient Training in Hyperbolic Space for Multi-modal Large Language Models
NeurIPS 2025
1
citations
CasCast: Skillful High-resolution Precipitation Nowcasting via Cascaded Modelling
ICML 2024
0
citations
OSDFace: One-Step Diffusion Model for Face Restoration
CVPR 2025
0
citations
Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding
CVPR 2025
0
citations
Star with Bilinear Mapping
CVPR 2025
0
citations
Domain Generalization in CLIP via Learning with Diverse Text Prompts
CVPR 2025
0
citations
PassionSR: Post-Training Quantization with Adaptive Scale in One-Step Diffusion based Image Super-Resolution
CVPR 2025
0
citations
Generalized Tensor-based Parameter-Efficient Fine-Tuning via Lie Group Transformations
ICCV 2025
0
citations
QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation
ICCV 2025
0
citations
A Token-level Text Image Foundation Model for Document Understanding
ICCV 2025
0
citations
HAODiff: Human-Aware One-Step Diffusion via Dual-Prompt Guidance
NeurIPS 2025
0
citations
DAWP: A framework for global observation forecasting via Data Assimilation and Weather Prediction in satellite observation space
NeurIPS 2025
0
citations
FATE: Feature-Adapted Parameter Tuning for Vision-Language Models
AAAI 2025
0
citations
SAM-PARSER: Fine-Tuning SAM Efficiently by Parameter Space Reconstruction
AAAI 2024arXiv
0
citations
LERE: Learning-Based Low-Rank Matrix Recovery with Rank Estimation
AAAI 2024
0
citations
Inter-X: Towards Versatile Human-Human Interaction Analysis
CVPR 2024
0
citations
ReGenNet: Towards Human Action-Reaction Synthesis
CVPR 2024
0
citations
S^3-Face: SSS-Compliant Facial Reflectance Estimation via Diffusion Priors
CVPR 2025
0
citations