Yangchen Pan
3
Papers
12
Total Citations
Papers (3)
Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods
ICLR 2024
8
citations
PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling
ICML 2025
4
citations
Position: Reinforcement Learning in Dynamic Treatment Regimes Needs Critical Reexamination
ICML 2024
0
citations