Yangchen Pan

3

Papers

12

Total Citations

Papers (3)

Understanding the robustness difference between stochastic gradient descent and adaptive gradient methods

PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling

Position: Reinforcement Learning in Dynamic Treatment Regimes Needs Critical Reexamination