Ethan Perez
7
Papers
335
Total Citations
Papers (7)
Inverse Scaling: When Bigger Isn't Better
ICLR 2025
180
citations
Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning
ICLR 2024
133
citations
Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
ICLR 2025
22
citations
Debating with More Persuasive LLMs Leads to More Truthful Answers
ICML 2024
0
citations
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
NeurIPS 2020
0
citations
True Few-Shot Learning with Language Models
NeurIPS 2021
0
citations
Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting
NeurIPS 2023
0
citations