Yudong Chen
8
Papers
21
Total Citations
Papers (8)
LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently
ICML 2025
8
citations
Soft Reasoning: Navigating Solution Spaces in Large Language Models through Controlled Embedding Exploration
ICML 2025
8
citations
Effectiveness of Constant Stepsize in Markovian LSA and Statistical Inference
AAAI 2024arXiv
4
citations
Stable Offline Value Function Learning with Bisimulation-based Representations
ICML 2025
1
citations
Minimally Modifying a Markov Game to Achieve Any Nash Equilibrium and Value
ICML 2024
0
citations
Medium-Difficulty Samples Constitute Smoothed Decision Boundary for Knowledge Distillation on Pruned Datasets
ICLR 2025
0
citations
Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces
ICML 2024
0
citations
The $\varphi$ Curve: The Shape of Generalization through the Lens of Norm-based Capacity Control
NeurIPS 2025
0
citations