Ruohan Gao
22
Papers
45
Total Citations
Papers (22)
The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective
CVPR 2024
15
citations
AURELIA: Test-time Reasoning Distillation in Audio-Visual LLMs
ICCV 2025
6
citations
Spherical World-Locking for Audio-Visual Localization in Egocentric Videos
ECCV 2024
6
citations
Hearing Anywhere in Any Environment
CVPR 2025
6
citations
Learning to Highlight Audio by Watching Movies
CVPR 2025arXiv
4
citations
GenFlowRL: Shaping Rewards with Generative Object-Centric Flow in Visual Reinforcement Learning
ICCV 2025arXiv
4
citations
EgoAdapt: Adaptive Multisensory Distillation and Policy Learning for Efficient Egocentric Perception
ICCV 2025
2
citations
Differentiable Room Acoustic Rendering with Multi-View Vision Priors
ICCV 2025
2
citations
RealImpact: A Dataset of Impact Sound Fields for Real Objects
CVPR 2023
0
citations
The ObjectFolder Benchmark: Multisensory Learning With Neural and Real Objects
CVPR 2023
0
citations
On-Demand Learning for Deep Image Restoration
ICCV 2017arXiv
0
citations
Co-Separating Sounds of Visual Objects
ICCV 2019
0
citations
VisualEchoes: Spatial Image Representation Learning through Echolocation
ECCV 2020
0
citations
2.5D Visual Sound
CVPR 2019
0
citations
AVTrustBench: Assessing and Enhancing Reliability and Robustness in Audio-Visual LLMs
ICCV 2025
0
citations
Hearing Anything Anywhere
CVPR 2024
0
citations
Im2Flow: Motion Hallucination From Static Images for Action Recognition
CVPR 2018arXiv
0
citations
Listen to Look: Action Recognition by Previewing Audio
CVPR 2020arXiv
0
citations
VisualVoice: Audio-Visual Speech Separation With Cross-Modal Consistency
CVPR 2021arXiv
0
citations
ObjectFolder 2.0: A Multisensory Object Dataset for Sim2Real Transfer
CVPR 2022
0
citations
Visual Acoustic Matching
CVPR 2022arXiv
0
citations
SoundCam: A Dataset for Finding Humans Using Room Acoustics
NeurIPS 2023
0
citations