Yudong Chen

22
Papers
296
Total Citations

Papers (22)

Fast Algorithms for Robust PCA via Gradient Descent

NeurIPS 2016arXiv
275
citations

Soft Reasoning: Navigating Solution Spaces in Large Language Models through Controlled Embedding Exploration

ICML 2025
8
citations

LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently

ICML 2025
8
citations

Effectiveness of Constant Stepsize in Markovian LSA and Statistical Inference

AAAI 2024arXiv
4
citations

Stable Offline Value Function Learning with Bisimulation-based Representations

ICML 2025
1
citations

Medium-Difficulty Samples Constitute Smoothed Decision Boundary for Knowledge Distillation on Pruned Datasets

ICLR 2025
0
citations

The $\varphi$ Curve: The Shape of Generalization through the Lens of Norm-based Capacity Control

NeurIPS 2025
0
citations

Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces

ICML 2024
0
citations

Minimally Modifying a Markov Game to Achieve Any Nash Equilibrium and Value

ICML 2024
0
citations

Tensor Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Tensors via Convex Optimization

CVPR 2016
0
citations

Deep Supervised Hashing With Anchor Graph

ICCV 2019
0
citations

Defending Against Saddle Point Attack in Byzantine-Robust Distributed Learning

ICML 2019
0
citations

Factor Group-Sparse Regularization for Efficient Low-Rank Matrix Recovery

NeurIPS 2019
0
citations

Global Convergence of Least Squares EM for Demixing Two Log-Concave Densities

NeurIPS 2019
0
citations

Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret

NeurIPS 2020
0
citations

Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning

NeurIPS 2021
0
citations

Rank Overspecified Robust Matrix Recovery: Subgradient Method and Exact Recovery

NeurIPS 2021
0
citations

Curriculum Disentangled Recommendation with Noisy Multi-feedback

NeurIPS 2021
0
citations

Improved Feature Distillation via Projector Ensemble

NeurIPS 2022
0
citations

Restless Bandits with Average Reward: Breaking the Uniform Global Attractor Assumption

NeurIPS 2023
0
citations

A Convex Optimization Framework for Bi-Clustering

ICML 2015
0
citations

Byzantine-Robust Distributed Learning: Towards Optimal Statistical Rates

ICML 2018
0
citations