2025 Spotlight "synthetic data generation" Papers
3 papers found
Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning
Jaehun Jung, Seungju Han, Ximing Lu et al.
NeurIPS 2025spotlightarXiv:2505.20161
15
citations
Spend Wisely: Maximizing Post-Training Gains in Iterative Synthetic Data Bootstrapping
Pu Yang, Yunzhen Feng, Ziyuan Chen et al.
NeurIPS 2025spotlightarXiv:2501.18962
1
citations
Virus Infection Attack on LLMs: Your Poisoning Can Spread "VIA" Synthetic Data
Zi Liang, Qingqing Ye, Xuan Liu et al.
NeurIPS 2025spotlight