NeurIPS 2025 "synthetic data generation" Papers
12 papers found
ChemOrch: Empowering LLMs with Chemical Intelligence via Groundbreaking Synthetic Instructions
Yue Huang, Zhengzhe Jiang, Xiaonan Luo et al.
NeurIPS 2025poster
GRIP: A Graph-Based Reasoning Instruction Producer
Jiankang Wang, Jianjun Xu, Xiaorui Wang et al.
NeurIPS 2025posterarXiv:2412.08864
2
citations
LongVPO: From Anchored Cues to Self-Reasoning for Long-Form Video Preference Optimization
Zhenpeng Huang, Jiaqi Li, zihan jia et al.
NeurIPS 2025poster
PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding
Jang Hyun Cho, Andrea Madotto, Effrosyni Mavroudi et al.
NeurIPS 2025oralarXiv:2504.13180
40
citations
Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning
Jaehun Jung, Seungju Han, Ximing Lu et al.
NeurIPS 2025spotlightarXiv:2505.20161
15
citations
Rethinking the Role of Verbatim Memorization in LLM Privacy
Tom Sander, Bargav Jayaraman, Mark Ibrahim et al.
NeurIPS 2025poster
RLGF: Reinforcement Learning with Geometric Feedback for Autonomous Driving Video Generation
Tianyi Yan, Wencheng Han, xia zhou et al.
NeurIPS 2025posterarXiv:2509.16500
Synthetic Series-Symbol Data Generation for Time Series Foundation Models
Wenxuan Wang, Kai Wu, yujian li et al.
NeurIPS 2025posterarXiv:2510.08445
Triplets Better Than Pairs: Towards Stable and Effective Self-Play Fine-Tuning for LLMs
Yibo Wang, Hai-Long Sun, Guangda Huzhang et al.
NeurIPS 2025posterarXiv:2601.08198
4
citations
V2V: Scaling Event-Based Vision through Efficient Video-to-Voxel Simulation
Hanyue Lou, Jinxiu Liang, Minggui Teng et al.
NeurIPS 2025oralarXiv:2505.16797
2
citations
Valid Inference with Imperfect Synthetic Data
Yewon Byun, Shantanu Gupta, Zachary Lipton et al.
NeurIPS 2025posterarXiv:2508.06635
Virus Infection Attack on LLMs: Your Poisoning Can Spread "VIA" Synthetic Data
Zi Liang, Qingqing Ye, Xuan Liu et al.
NeurIPS 2025spotlight