"synthetic data generation" Papers

104 papers found • Page 2 of 3

Filters:synthetic data generation Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

Rethinking the Role of Verbatim Memorization in LLM Privacy

Tom Sander, Bargav Jayaraman, Mark Ibrahim et al.

NEURIPS 2025

RLGF: Reinforcement Learning with Geometric Feedback for Autonomous Driving Video Generation

Tianyi Yan, Wencheng Han, xia zhou et al.

NEURIPS 2025arXiv:2509.16500

citations

RoboTron-Sim: Improving Real-World Driving via Simulated Hard-Case

Baihui Xiao, Chengjian Feng, Zhijian Huang et al.

ICCV 2025arXiv:2508.04642

citations

ROSE: Remove Objects with Side Effects in Videos

Chenxuan Miao, Yutong Feng, Jianshu Zeng et al.

NEURIPS 2025arXiv:2508.18633

citations

RUAGO: Effective and Practical Retain-Free Unlearning via Adversarial Attack and OOD Generator

SangYong Lee, Sangjun Chung, Simon Woo

NEURIPS 2025

Scaling Instruction-tuned LLMs to Million-token Contexts via Hierarchical Synthetic Data Generation

Linda He, Jue Wang, Maurice Weber et al.

ICLR 2025arXiv:2504.12637

citations

SimpleStrat: Diversifying Language Model Generation with Stratification

Justin Wong, Yury Orlovskiy, Alexander Shypula et al.

NEURIPS 2025arXiv:2410.09038

citations

SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs

Xin Su, Man Luo, Kris Pan et al.

ICML 2025oralarXiv:2406.19593

citations

SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios

Kai Li, Wendi Sang, Chang Zeng et al.

ICLR 2025arXiv:2410.01481

citations

Spend Wisely: Maximizing Post-Training Gains in Iterative Synthetic Data Bootstrapping

Pu Yang, Yunzhen Feng, Ziyuan Chen et al.

NEURIPS 2025spotlightarXiv:2501.18962

citations

SQL-R1: Training Natural Language to SQL Reasoning Model By Reinforcement Learning

Peixian Ma, Xialie Zhuang, Chengjin Xu et al.

NEURIPS 2025arXiv:2504.08600

citations

SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data

Xilin He, Cheng Luo, Xiaole Xian et al.

ICCV 2025arXiv:2410.09865

citations

Synthetic Data is an Elegant GIFT for Continual Vision-Language Models

Bin Wu, Wuxuan Shi, Jinqiao Wang et al.

CVPR 2025arXiv:2503.04229

citations

Synthetic Series-Symbol Data Generation for Time Series Foundation Models

Wenxuan Wang, Kai Wu, yujian li et al.

NEURIPS 2025arXiv:2510.08445

Synthetic Visual Genome

Jae Sung Park, Zixian Ma, Linjie Li et al.

CVPR 2025arXiv:2506.07643

citations

Task-Specific Zero-shot Quantization-Aware Training for Object Detection

Changhao Li, Xinrui Chen, Ji Wang et al.

ICCV 2025arXiv:2507.16782

citations

ToolACE: Winning the Points of LLM Function Calling

Weiwen Liu, Xu Huang, Xingshan Zeng et al.

ICLR 2025arXiv:2409.00920

124

citations

Towards a Theoretical Understanding of Synthetic Data in LLM Post-Training: A Reverse-Bottleneck Perspective

Zeyu Gan, Yong Liu

ICLR 2025arXiv:2410.01720

citations

Training Language Models on Synthetic Edit Sequences Improves Code Synthesis

Ulyana Piterbarg, Lerrel Pinto, Rob Fergus

ICLR 2025arXiv:2410.02749

citations

Triplets Better Than Pairs: Towards Stable and Effective Self-Play Fine-Tuning for LLMs

Yibo Wang, Hai-Long Sun, Guangda Huzhang et al.

NEURIPS 2025arXiv:2601.08198

citations

V2V: Scaling Event-Based Vision through Efficient Video-to-Voxel Simulation

Hanyue Lou, Jinxiu Liang, Minggui Teng et al.

NEURIPS 2025oralarXiv:2505.16797

citations

Valid Inference with Imperfect Synthetic Data

Yewon Byun, Shantanu Gupta, Zachary Lipton et al.

NEURIPS 2025arXiv:2508.06635

citations

Virus Infection Attack on LLMs: Your Poisoning Can Spread "VIA" Synthetic Data

Zi Liang, Qingqing Ye, Xuan Liu et al.

NEURIPS 2025spotlight

VT-FSL: Bridging Vision and Text with LLMs for Few-Shot Learning

Wenhao Li, Qiangchang Wang, Xianjing Meng et al.

NEURIPS 2025arXiv:2509.25033

citations

Zero-Shot Monocular Scene Flow Estimation in the Wild

Yiqing Liang, Abhishek Badki, Hang Su et al.

CVPR 2025arXiv:2501.10357

citations

3DGazeNet: Generalizing Gaze Estimation with Weak Supervision from Synthetic Views

Evangelos Ververas, Polydefkis Gkagkos, Jiankang Deng et al.

ECCV 2024arXiv:2212.02997

citations

AlignDiff: Aligning Diffusion Models for General Few-Shot Segmentation

Ri-Zhao Qiu, Yu-Xiong Wang, Kris Hauser

ECCV 2024

citations

Are Synthetic Data Useful for Egocentric Hand-Object Interaction Detection?

Rosario Leonardi, Antonino Furnari, Francesco Ragusa et al.

ECCV 2024arXiv:2312.02672

citations

CaPS: Collaborative and Private Synthetic Data Generation from Distributed Sources

Sikha Pentyala, Mayana Pereira, Martine De Cock

ICML 2024arXiv:2402.08614

citations

ConSequence: Synthesizing Logically Constrained Sequences for Electronic Health Record Generation

Brandon Theodorou, Shrusti Jain, Cao Xiao et al.

AAAI 2024paperarXiv:2312.05964

citations

Curated LLM: Synergy of LLMs and Data Curation for tabular augmentation in low-data regimes

Nabeel Seedat, Nicolas Huynh, Boris van Breugel et al.

ICML 2024arXiv:2312.12112

citations

CuTS: Customizable Tabular Synthetic Data Generation

Mark Vero, Mislav Balunovic, Martin Vechev

ICML 2024arXiv:2307.03577

citations

Data-to-Model Distillation: Data-Efficient Learning Framework

Ahmad Sajedi, Samir Khaki, Lucy Z. Liu et al.

ECCV 2024arXiv:2411.12841

citations

Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model

Junghun Cha, Ali Haider, Seoyun Yang et al.

AAAI 2024paperarXiv:2402.05350

citations

DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception

Yibo Wang, Ruiyuan Gao, Kai Chen et al.

CVPR 2024arXiv:2403.13304

citations

Differentially Private Sum-Product Networks

Xenia Heilmann, Mattia Cerrato, Ernst Althaus

ICML 2024

Differentially Private Synthetic Data via Foundation Model APIs 2: Text

Chulin Xie, Zinan Lin, Arturs Backurs et al.

ICML 2024spotlightarXiv:2403.01749

citations

DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation

Xiaobin Hu, Xu Peng, Donghao Luo et al.

ECCV 2024arXiv:2403.06168

citations

Distill Gold from Massive Ores: Bi-level Data Pruning towards Efficient Dataset Distillation

YUE XU, Yong-Lu Li, Kaitong Cui et al.

ECCV 2024arXiv:2305.18381

citations

DreamStruct: Understanding Slides and User Interfaces via Synthetic Data Generation

Yi-Hao Peng, Faria Huq, Yue Jiang et al.

ECCV 2024arXiv:2410.00201

citations

EgoGen: An Egocentric Synthetic Data Generator

Gen Li, Kaifeng Zhao, Siwei Zhang et al.

CVPR 2024arXiv:2401.08739

citations

FlexKBQA: A Flexible LLM-Powered Framework for Few-Shot Knowledge Base Question Answering

Zhenyu Li, Sunqi Fan, Yu Gu et al.

AAAI 2024paperarXiv:2308.12060

125

citations

GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements

Alexander Havrilla, Sharath Chandra Raparthy, Christoforos Nalmpantis et al.

ICML 2024arXiv:2402.10963

102

citations

Human Pose Recognition via Occlusion-Preserving Abstract Images

Saad Manzur, Wayne B Hayes

ECCV 2024

citations

Image Captioning with Multi-Context Synthetic Data

Feipeng Ma, Y. Zhou, Fengyun Rao et al.

AAAI 2024paperarXiv:2305.18072

citations

PEGASUS: Personalized Generative 3D Avatars with Composable Attributes

Hyunsoo Cha, Byungjun Kim, Hanbyul Joo

CVPR 2024arXiv:2402.10636

citations

Position: Will we run out of data? Limits of LLM scaling based on human-generated data

Pablo Villalobos, Anson Ho, Jaime Sevilla et al.

ICML 2024

PrE-Text: Training Language Models on Private Federated Data in the Age of LLMs

Charlie Hou, Akshat Shrivastava, Hongyuan Zhan et al.

ICML 2024arXiv:2406.02958

citations

Reliability in Semantic Segmentation: Can We Use Synthetic Data?

Thibaut Loiseau, Tuan Hung Vu, Mickael Chen et al.

ECCV 2024arXiv:2312.09231

citations

Sharpness-Aware Data Generation for Zero-shot Quantization

Hoang Dung, Cuong Pham, Trung Le et al.

ICML 2024arXiv:2510.07018

citations

← Previous

1 2 3