Ruohan Gao

22
Papers
45
Total Citations

Papers (22)

The Audio-Visual Conversational Graph: From an Egocentric-Exocentric Perspective

CVPR 2024
15
citations

AURELIA: Test-time Reasoning Distillation in Audio-Visual LLMs

ICCV 2025
6
citations

Spherical World-Locking for Audio-Visual Localization in Egocentric Videos

ECCV 2024
6
citations

Hearing Anywhere in Any Environment

CVPR 2025
6
citations

Learning to Highlight Audio by Watching Movies

CVPR 2025arXiv
4
citations

GenFlowRL: Shaping Rewards with Generative Object-Centric Flow in Visual Reinforcement Learning

ICCV 2025arXiv
4
citations

EgoAdapt: Adaptive Multisensory Distillation and Policy Learning for Efficient Egocentric Perception

ICCV 2025
2
citations

Differentiable Room Acoustic Rendering with Multi-View Vision Priors

ICCV 2025
2
citations

RealImpact: A Dataset of Impact Sound Fields for Real Objects

CVPR 2023
0
citations

The ObjectFolder Benchmark: Multisensory Learning With Neural and Real Objects

CVPR 2023
0
citations

On-Demand Learning for Deep Image Restoration

ICCV 2017arXiv
0
citations

Co-Separating Sounds of Visual Objects

ICCV 2019
0
citations

VisualEchoes: Spatial Image Representation Learning through Echolocation

ECCV 2020
0
citations

2.5D Visual Sound

CVPR 2019
0
citations

AVTrustBench: Assessing and Enhancing Reliability and Robustness in Audio-Visual LLMs

ICCV 2025
0
citations

Hearing Anything Anywhere

CVPR 2024
0
citations

Im2Flow: Motion Hallucination From Static Images for Action Recognition

CVPR 2018arXiv
0
citations

Listen to Look: Action Recognition by Previewing Audio

CVPR 2020arXiv
0
citations

VisualVoice: Audio-Visual Speech Separation With Cross-Modal Consistency

CVPR 2021arXiv
0
citations

ObjectFolder 2.0: A Multisensory Object Dataset for Sim2Real Transfer

CVPR 2022
0
citations

Visual Acoustic Matching

CVPR 2022arXiv
0
citations

SoundCam: A Dataset for Finding Humans Using Room Acoustics

NeurIPS 2023
0
citations