Hanlin Zhang
4
Papers
37
Total Citations
Papers (4)
How Does Critical Batch Size Scale in Pre-training?
ICLR 2025arXiv
37
citations
Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems
ICLR 2025arXiv
0
citations
Eliminating Position Bias of Language Models: A Mechanistic Approach
ICLR 2025arXiv
0
citations
EvoLM: In Search of Lost Language Model Training Dynamics
NeurIPS 2025arXiv
0
citations