Kaifeng Lyu
6
Papers
344
Total Citations
1
Affiliations
Affiliations
Tsinghua University
Papers (6)
Safety Alignment Should be Made More Than Just a Few Tokens Deep
ICLR 2025
277
citations
RNNs are not Transformers (Yet): The Key Bottleneck on In-Context Retrieval
ICLR 2025
48
citations
A Multi-Power Law for Loss Curve Prediction Across Learning Rate Schedules
ICLR 2025
13
citations
A Quadratic Synchronization Rule for Distributed Deep Learning
ICLR 2024
4
citations
Data Mixing Can Induce Phase Transitions in Knowledge Acquisition
NeurIPS 2025arXiv
2
citations
Adam Reduces a Unique Form of Sharpness: Theoretical Insights Near the Minimizer Manifold
NeurIPS 2025
0
citations