Yang Zhou

29
Papers
264
Total Citations

Papers (29)

FedASMU: Efficient Asynchronous Federated Learning with Dynamic Staleness-Aware Model Update

AAAI 2024arXiv
71
citations

Aether: Geometric-Aware Unified World Modeling

ICCV 2025
47
citations

SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction

CVPR 2024
38
citations

HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction

ICLR 2025
34
citations

The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio

NeurIPS 2025
26
citations

Attention Distillation: A Unified Approach to Visual Characteristics Transfer

CVPR 2025
21
citations

In-Hand 3D Object Reconstruction from a Monocular RGB Video

AAAI 2024arXiv
7
citations

Visual Persona: Foundation Model for Full-Body Human Customization

CVPR 2025
6
citations

Self-Evolutionary Large Language Models Through Uncertainty-Enhanced Preference Optimization

AAAI 2025
5
citations

DMesh++: An Efficient Differentiable Mesh for Complex Shapes

ICCV 2025arXiv
3
citations

ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries

AAAI 2025
2
citations

Bridging Symmetry and Robustness: On the Role of Equivariance in Enhancing Adversarial Robustness

NeurIPS 2025arXiv
2
citations

Describe, Adapt and Combine: Empowering CLIP Encoders for Open-set 3D Object Retrieval

ICCV 2025
1
citations

PAC-Bayes Bounds for Multivariate Linear Regression and Linear Autoencoders

NeurIPS 2025
1
citations

Free-viewpoint Human Animation with Pose-correlated Reference Selection

CVPR 2025
0
citations

HexGen: Generative Inference of Large Language Model over Heterogeneous Environment

ICML 2024
0
citations

Move-in-2D: 2D-Conditioned Human Motion Generation

CVPR 2025
0
citations

VideoGigaGAN: Towards Detail-rich Video Super-Resolution

CVPR 2025
0
citations

VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation

ICCV 2025
0
citations

Video Motion Graphs

ICCV 2025
0
citations

Visual Textualization for Image Prompted Object Detection

ICCV 2025
0
citations

Instance-Level Video Depth in Groups Beyond Occlusions

ICCV 2025
0
citations

Privacy-Preserving V2X Collaborative Perception Integrating Unknown Collaborators

AAAI 2025
0
citations

Treasures in Discarded Weights for LLM Quantization

AAAI 2025
0
citations

Tuning Stable Rank Shrinkage: Aiming at the Overlooked Structural Risk in Fine-tuning

CVPR 2024
0
citations

Deformable One-shot Face Stylization via DINO Semantic Guidance

CVPR 2024
0
citations

Leveraging Frame Affinity for sRGB-to-RAW Video De-rendering

CVPR 2024
0
citations

Generating Non-Stationary Textures using Self-Rectification

CVPR 2024
0
citations

Effective Federated Graph Matching

ICML 2024
0
citations