Yuxin Guo
7
Papers
28
Total Citations
Papers (7)
UniMLVG: Unified Framework for Multi-view Long Video Generation with Comprehensive Control Capabilities for Autonomous Driving
ICCV 2025
11
citations
GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers
ICCV 2025
9
citations
Aligned Better, Listen Better for Audio-Visual Large Language Models
ICLR 2025
8
citations
MaskGWM: A Generalizable Driving World Model with Video Mask Reconstruction
CVPR 2025
0
citations
CrossMAE: Cross-Modality Masked Autoencoders for Region-Aware Audio-Visual Pre-Training
CVPR 2024
0
citations
On the Nonlinearity of Layer Normalization
ICML 2024
0
citations
Dual Mean-Teacher: An Unbiased Semi-Supervised Framework for Audio-Visual Source Localization
NeurIPS 2023
0
citations