Puyuan Peng
4
Papers
47
Total Citations
Papers (4)
SyllableLM: Learning Coarse Semantic Units for Speech Language Models
ICLR 2025arXiv
22
citations
Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos
ECCV 2024
19
citations
VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models
ICCV 2025
6
citations
BAT: Learning to Reason about Spatial Sounds with Large Language Models
ICML 2024
0
citations