Honglu Fan
4
Papers
420
Total Citations
Papers (4)
YaRN: Efficient Context Window Extension of Large Language Models
ICLR 2024
410
citations
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text
NeurIPS 2025arXiv
10
citations
Grokking Group Multiplication with Cosets
ICML 2024
0
citations
Stay on Topic with Classifier-Free Guidance
ICML 2024
0
citations