"large-scale pretraining" Papers
2 papers found
HELM: Hyperbolic Large Language Models via Mixture-of-Curvature Experts
Neil He, Rishabh Anand, Hiren Madhu et al.
NeurIPS 2025posterarXiv:2505.24722
8
citations
Position: Considerations for Differentially Private Learning with Large-Scale Public Pretraining
Florian Tramer, Gautam Kamath, Nicholas Carlini
ICML 2024poster