John Hughes

4

Papers

62

Total Citations

Papers (4)

Looking Inward: Language Models Can Learn About Themselves by Introspection

Failures to Find Transferable Image Jailbreaks Between Vision-Language Models

Debating with More Persuasive LLMs Leads to More Truthful Answers

Hierarchical Quantized Autoencoders