Amrit Singh Bedi
7
Papers
0
Total Citations
Papers (7)
Immune: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
CVPR 2025
0
citations
Closing the Gap: Achieving Global Convergence (Last Iterate) of Actor-Critic under Markovian Sampling with Neural Network Parametrization
ICML 2024
0
citations
On the Sample Complexity and Metastability of Heavy-tailed Policy Search in Continuous Control
ICML 2024
0
citations
Position: On the Possibilities of AI-Generated Text Detection
ICML 2024
0
citations
PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling
ICML 2024
0
citations
MaxMin-RLHF: Alignment with Diverse Human Preferences
ICML 2024
0
citations
Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles
ICML 2024
0
citations