Rishub Tamirisa
4
Papers
175
Total Citations
Papers (4)
Tamper-Resistant Safeguards for Open-Weight LLMs
ICLR 2025arXiv
108
citations
FedSelect: Personalized Federated Learning with Customized Selection of Parameters for Fine-Tuning
CVPR 2024
36
citations
Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs
NeurIPS 2025
31
citations
The WMDP Benchmark: Measuring and Reducing Malicious Use with Unlearning
ICML 2024
0
citations