ECCV 2024 "proximal policy optimization" Papers
2 papers found
Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models
Minchan Kim, Minyeong Kim, Junik Bae et al.
ECCV 2024posterarXiv:2403.16167
10
citations
Multimodal Label Relevance Ranking via Reinforcement Learning
Taian Guo, Taolin Zhang, Haoqian Wu et al.
ECCV 2024posterarXiv:2407.13221
1
citations