ICLR Poster "multimodal alignment" Papers
3 papers found
Can Video LLMs Refuse to Answer? Alignment for Answerability in Video Large Language Models
Eunseop Yoon, Hee Suk Yoon, Mark Hasegawa-Johnson et al.
ICLR 2025posterarXiv:2507.04976
4
citations
FLOPS: Forward Learning with OPtimal Sampling
Tao Ren, Zishi Zhang, Jinyang Jiang et al.
ICLR 2025posterarXiv:2410.05966
2
citations
MMDisCo: Multi-Modal Discriminator-Guided Cooperative Diffusion for Joint Audio and Video Generation
Akio Hayakawa, Masato Ishii, Takashi Shibuya et al.
ICLR 2025posterarXiv:2405.17842
16
citations