"flow-based transformer" Papers
2 papers found
Model-Guided Dual-Role Alignment for High-Fidelity Open-Domain Video-to-Audio Generation
Kang Zhang, Trung X. Pham, Suyeon Lee et al.
NeurIPS 2025posterarXiv:2510.24103
VinTAGe: Joint Video and Text Conditioning for Holistic Audio Generation
Saksham Singh Kushwaha, Yapeng Tian
CVPR 2025posterarXiv:2412.10768
12
citations