David Harwath
5
Papers
58
Total Citations
Papers (5)
SyllableLM: Learning Coarse Semantic Units for Speech Language Models
ICLR 2025arXiv
22
citations
Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos
ECCV 2024
19
citations
SoundingActions: Learning How Actions Sound from Narrated Egocentric Videos
CVPR 2024
11
citations
VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models
ICCV 2025
6
citations
BAT: Learning to Reason about Spatial Sounds with Large Language Models
ICML 2024
0
citations