α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Himabindu Lakkaraju
Himabindu Lakkaraju
3
Papers
10
Total Citations
Papers (3)
More RLHF, More Trust? On The Impact of Preference Alignment On Trustworthiness
ICLR 2025
10
citations
Understanding the Effects of Iterative Prompting on Truthfulness
ICML 2024
0
citations
In-Context Unlearning: Language Models as Few-Shot Unlearners
ICML 2024
0
citations