Yudong Chen
22
Papers
296
Total Citations
Papers (22)
Fast Algorithms for Robust PCA via Gradient Descent
NeurIPS 2016arXiv
275
citations
Soft Reasoning: Navigating Solution Spaces in Large Language Models through Controlled Embedding Exploration
ICML 2025
8
citations
LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently
ICML 2025
8
citations
Effectiveness of Constant Stepsize in Markovian LSA and Statistical Inference
AAAI 2024arXiv
4
citations
Stable Offline Value Function Learning with Bisimulation-based Representations
ICML 2025
1
citations
Medium-Difficulty Samples Constitute Smoothed Decision Boundary for Knowledge Distillation on Pruned Datasets
ICLR 2025
0
citations
The $\varphi$ Curve: The Shape of Generalization through the Lens of Norm-based Capacity Control
NeurIPS 2025
0
citations
Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces
ICML 2024
0
citations
Minimally Modifying a Markov Game to Achieve Any Nash Equilibrium and Value
ICML 2024
0
citations
Tensor Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Tensors via Convex Optimization
CVPR 2016
0
citations
Deep Supervised Hashing With Anchor Graph
ICCV 2019
0
citations
Defending Against Saddle Point Attack in Byzantine-Robust Distributed Learning
ICML 2019
0
citations
Factor Group-Sparse Regularization for Efficient Low-Rank Matrix Recovery
NeurIPS 2019
0
citations
Global Convergence of Least Squares EM for Demixing Two Log-Concave Densities
NeurIPS 2019
0
citations
Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret
NeurIPS 2020
0
citations
Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning
NeurIPS 2021
0
citations
Rank Overspecified Robust Matrix Recovery: Subgradient Method and Exact Recovery
NeurIPS 2021
0
citations
Curriculum Disentangled Recommendation with Noisy Multi-feedback
NeurIPS 2021
0
citations
Improved Feature Distillation via Projector Ensemble
NeurIPS 2022
0
citations
Restless Bandits with Average Reward: Breaking the Uniform Global Attractor Assumption
NeurIPS 2023
0
citations
A Convex Optimization Framework for Bi-Clustering
ICML 2015
0
citations
Byzantine-Robust Distributed Learning: Towards Optimal Statistical Rates
ICML 2018
0
citations