🧬Generative Models

Generative Modeling

General generative model theory and methods

100 papers6,239 total citations
Compare with other topics
Mar '24 Feb '26929 papers
Also includes: generative models, generative modeling, density estimation

Top Papers

#1

VBench: Comprehensive Benchmark Suite for Video Generative Models

Ziqi Huang, Yinan He, Jiashuo Yu et al.

CVPR 2024arXiv:2311.17982
996
citations
#2

A Generalist Agent

Jackie Kay, Sergio Gómez Colmenarejo, Mahyar Bordbar et al.

ICLR 2024
978
citations
#3

Generative Multimodal Models are In-Context Learners

Quan Sun, Yufeng Cui, Xiaosong Zhang et al.

CVPR 2024arXiv:2312.13286
422
citations
#4

Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs

Suyu Ge, Yunan Zhang, Liyuan Liu et al.

ICLR 2024arXiv:2310.01801
372
citations
#5

Generative Verifiers: Reward Modeling as Next-Token Prediction

Lunjun Zhang, Arian Hosseini, Hritik Bansal et al.

ICLR 2025
348
citations
#6

Generative Representational Instruction Tuning

Niklas Muennighoff, Hongjin SU, Liang Wang et al.

ICLR 2025arXiv:2402.09906
instruction tuningtext embeddinggenerative tasksretrieval-augmented generation+3
212
citations
#7

GART: Gaussian Articulated Template Models

Jiahui Lei, Yufu Wang, Georgios Pavlakos et al.

CVPR 2024arXiv:2311.16099
129
citations
#8

GenSim: Generating Robotic Simulation Tasks via Large Language Models

Lirui Wang, Yiyang Ling, Zhecheng Yuan et al.

ICLR 2024arXiv:2310.01361
120
citations
#9

Generative Image Dynamics

Zhengqi Li, Richard Tucker, Noah Snavely et al.

CVPR 2024arXiv:2309.07906
93
citations
#10

DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation

Yuang Peng, Yuxin Cui, Haomiao Tang et al.

ICLR 2025
91
citations
#11

EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling

Haiyang Liu, Zihao Zhu, Giorgio Becherini et al.

CVPR 2024arXiv:2401.00374
78
citations
#12

Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Choi Yisol, Sangkyung Kwak, Kyungmin Lee et al.

ECCV 2024
72
citations
#13

MaskBit: Embedding-free Image Generation via Bit Tokens

Mark Weber, Lijun Yu, Qihang Yu et al.

ICLR 2025arXiv:2409.16211
72
citations
#14

Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation

Fanqing Meng, Jiaqi Liao, Xinyu Tan et al.

ICML 2025arXiv:2410.05363
72
citations
#15

OmniGlue: Generalizable Feature Matching with Foundation Model Guidance

Hanwen Jiang, Arjun Karpur, Bingyi Cao et al.

CVPR 2024arXiv:2405.12979
68
citations
#16

GIVT: Generative Infinite-Vocabulary Transformers

Michael Tschannen, Cian Eastwood, Fabian Mentzer

ECCV 2024arXiv:2312.02116
generative transformersvector sequence generationmultivariate gaussian mixture modelslatent diffusion models+4
63
citations
#17

ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation

Cheng Yang, Chufan Shi, Yaxin Liu et al.

ICLR 2025arXiv:2406.09961
chart-to-code generationmultimodal reasoning evaluationvisual understandingcode generation+4
63
citations
#18

Proteina: Scaling Flow-based Protein Structure Generative Models

Tomas Geffner, Kieran Didi, Zuobai Zhang et al.

ICLR 2025
53
citations
#19

IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination

Xi Chen, Sida Peng, Dongchen Yang et al.

ECCV 2024arXiv:2404.11593
inverse renderingmaterial recoverydiffusion priorsunknown illumination+4
52
citations
#20

AvatarGPT: All-in-One Framework for Motion Understanding Planning Generation and Beyond

Zixiang Zhou, Yu Wan, Baoyuan Wang

CVPR 2024
52
citations
#21

Intriguing Properties of Generative Classifiers

Priyank Jaini, Kevin Clark, Robert Geirhos

ICLR 2024arXiv:2309.16779
51
citations
#22

Physics-Informed Diffusion Models

Jan-Hendrik Bastek, WaiChing Sun, Dennis Kochmann

ICLR 2025arXiv:2403.14404
diffusion modelsphysics-informed learningpartial differential equationsgenerative modeling+4
50
citations
#23

From Zero to Turbulence: Generative Modeling for 3D Flow Simulation

Marten Lienen, David Lüdke, Jan Hansen-Palmus et al.

ICLR 2024arXiv:2306.01776
49
citations
#24

DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control

Yuru Jia, Lukas Hoyer, Shengyu Huang et al.

ECCV 2024
48
citations
#25

Local Search GFlowNets

Minsu Kim, Yun Taeyoung, Emmanuel Bengio et al.

ICLR 2024arXiv:2310.02710
48
citations
#26

Energy-Based Diffusion Language Models for Text Generation

Minkai Xu, Tomas Geffner, Karsten Kreis et al.

ICLR 2025arXiv:2410.21357
48
citations
#27

GAIA: Zero-shot Talking Avatar Generation

Tianyu He, Junliang Guo, Runyi Yu et al.

ICLR 2024
46
citations
#28

DrivingGPT: Unifying Driving World Modeling and Planning with Multi-modal Autoregressive Transformers

Yuntao Chen, Yuqi Wang, Zhaoxiang Zhang

ICCV 2025arXiv:2412.18607
44
citations
#29

Generative Proxemics: A Prior for 3D Social Interaction from Images

Vickie Ye, Vickie Ye, Georgios Pavlakos et al.

CVPR 2024arXiv:2306.09337
41
citations
#30

Generator Matching: Generative modeling with arbitrary Markov processes

Peter Holderrieth, Marton Havasi, Jason Yim et al.

ICLR 2025arXiv:2410.20587
generative modelingmarkov processesdiffusion modelsflow matching+4
40
citations
#31

DetDiffusion: Synergizing Generative and Perceptive Models for Enhanced Data Generation and Perception

Yibo Wang, Ruiyuan Gao, Kai Chen et al.

CVPR 2024arXiv:2403.13304
39
citations
#32

TestGenEval: A Real World Unit Test Generation and Test Completion Benchmark

Kush Jain, Gabriel Synnaeve, Baptiste Roziere

ICLR 2025arXiv:2410.00752
39
citations
#33

UGG: Unified Generative Grasping

Jiaxin Lu, Hao Kang, Haoxiang Li et al.

ECCV 2024
35
citations
#34

Active Generalized Category Discovery

Shijie Ma, Fei Zhu, Zhun Zhong et al.

CVPR 2024arXiv:2403.04272
34
citations
#35

Rethinking Generalizable Face Anti-spoofing via Hierarchical Prototype-guided Distribution Refinement in Hyperbolic Space

Chengyang Hu, Ke-Yue Zhang, Taiping Yao et al.

CVPR 2024
32
citations
#36

Rethinking Graph Masked Autoencoders through Alignment and Uniformity

Liang Wang, Xiang Tao, Qiang Liu et al.

AAAI 2024arXiv:2402.07225
graph masked autoencodersgraph contrastive learningself-supervised learningalignment and uniformity+3
32
citations
#37

Jointly Training Large Autoregressive Multimodal Models

Emanuele Aiello, Lili Yu, Yixin Nie et al.

ICLR 2024arXiv:2309.15564
32
citations
#38

PosterLlama: Bridging Design Ability of Langauge Model to Content-Aware Layout Generation

Jaejung Seol, Seojun Kim, Jaejun Yoo

ECCV 2024arXiv:2404.00995
layout generationcontent-aware designhtml code generationlanguage model knowledge+4
30
citations
#39

Masked Generative Video-to-Audio Transformers with Enhanced Synchronicity

Santiago Pascual, Chunghsin YEH, Ioannis Tsiamas et al.

ECCV 2024arXiv:2407.10387
video-to-audio generationaudio-visual synchronizationgenerative audio codecmasked generative model+2
30
citations
#40

LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding

Doohyuk Jang, Sihwan Park, June Yong Yang et al.

ICLR 2025arXiv:2410.03355
29
citations
#41

OmniPhysGS: 3D Constitutive Gaussians for General Physics-Based Dynamics Generation

Yuchen Lin, Chenguo Lin, Jianjin Xu et al.

ICLR 2025
29
citations
#42

Generalization Analysis of Machine Learning Algorithms via the Worst-Case Data-Generating Probability Measure

Xinying Zou, Samir Perlaza, Inaki Esnaola et al.

AAAI 2024arXiv:2312.12236
worst-case probability measuregeneralization gap analysisgibbs probability measureexpected loss sensitivity+4
26
citations
#43

Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models

Shengqu Cai, Duygu Ceylan, Matheus Gadelha et al.

CVPR 2024arXiv:2312.01409
26
citations
#44

Efficient and Scalable Graph Generation through Iterative Local Expansion

Andreas Bergmeister, Karolis Martinkus, Nathanaël Perraudin et al.

ICLR 2024arXiv:2312.11529
26
citations
#45

EgoGen: An Egocentric Synthetic Data Generator

Gen Li, Kaifeng Zhao, Siwei Zhang et al.

CVPR 2024arXiv:2401.08739
24
citations
#46

Whole-Song Hierarchical Generation of Symbolic Music Using Cascaded Diffusion Models

Ziyu Wang, Lejun Min, Gus Xia

ICLR 2024arXiv:2405.09901
24
citations
#47

Template Free Reconstruction of Human-object Interaction with Procedural Interaction Generation

Xianghui Xie, Bharat Lal Bhatnagar, Jan Lenssen et al.

CVPR 2024arXiv:2312.07063
24
citations
#48

Semantic-Guided Generative Image Augmentation Method with Diffusion Models for Image Classification

Bohan Li, Xiao Xu, Xinghao Wang et al.

AAAI 2024arXiv:2302.02070
image augmentationdiffusion modelssemantic consistencyimage classification+2
24
citations
#49

JetFormer: An autoregressive generative model of raw images and text

Michael Tschannen, André Susano Pinto, Alexander Kolesnikov

ICLR 2025
23
citations
#50

SGFormer: Semantic Graph Transformer for Point Cloud-Based 3D Scene Graph Generation

Changsheng Lv, Mengshi Qi, Xia Li et al.

AAAI 2024arXiv:2303.11048
3d scene graph generationpoint cloud parsingsemantic graph transformerglobal information passing+4
23
citations
#51

Would Deep Generative Models Amplify Bias in Future Models?

Tianwei Chen, Yusuke Hirota, Mayu Otani et al.

CVPR 2024arXiv:2404.03242
23
citations
#52

DIM: Dyadic Interaction Modeling for Social Behavior Generation

Minh Tran, Di Chang, Maksim Siniukov et al.

ECCV 2024
dyadic interaction modelingsocial behavior generation3d facial motioncontrastive learning+4
22
citations
#53

A Multi-Modal Contrastive Diffusion Model for Therapeutic Peptide Generation

Yongkang Wang, Xuan Liu, Feng Huang et al.

AAAI 2024arXiv:2312.15665
therapeutic peptide generationmulti-modal fusioncontrastive learningdiffusion models+3
22
citations
#54

GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models

Jian Ma, Yonglin Deng, Chen Chen et al.

AAAI 2025arXiv:2407.02252
22
citations
#55

SLEDGE: Synthesizing Driving Environments with Generative Models and Rule-Based Traffic

Kashyap Chitta, Daniel Dauner, Andreas Geiger

ECCV 2024arXiv:2403.17933
driving environment synthesisvehicle motion planninggenerative simulationlane graph representation+4
21
citations
#56

DataGen: Unified Synthetic Dataset Generation via Large Language Models

Yue Huang, Siyuan Wu, Chujie Gao et al.

ICLR 2025arXiv:2406.18966
synthetic data generationlarge language modelsretrieval-augmented generationlabel verification+4
21
citations
#57

Generative Video Propagation

Shaoteng Liu, Tianyu Wang, Jui-Hsien Wang et al.

CVPR 2025
20
citations
#58

Towards Open Domain Text-Driven Synthesis of Multi-Person Motions

Shan Mengyi, Lu Dong, Yutao Han et al.

ECCV 2024
20
citations
#59

ExeDec: Execution Decomposition for Compositional Generalization in Neural Program Synthesis

Kensen Shi, Joey Hong, Yinlin Deng et al.

ICLR 2024arXiv:2307.13883
20
citations
#60

On the Closed-Form of Flow Matching: Generalization Does Not Arise from Target Stochasticity

Quentin Bertrand, Anne Gagneux, Mathurin Massias et al.

NeurIPS 2025
19
citations
#61

Distinguished In Uniform: Self-Attention Vs. Virtual Nodes

Eran Rosenbluth, Jan Tönshoff, Martin Ritzert et al.

ICLR 2024
19
citations
#62

To Grok or not to Grok: Disentangling Generalization and Memorization on Corrupted Algorithmic Datasets

Darshil Doshi, Aritra Das, Tianyu He et al.

ICLR 2024
19
citations
#63

Adaptive Message Passing: A General Framework to Mitigate Oversmoothing, Oversquashing, and Underreaching

Federico Errica, Henrik Christiansen, Viktor Zaverkin et al.

ICML 2025
19
citations
#64

Neuroformer: Multimodal and Multitask Generative Pretraining for Brain Data

Antonis Antoniades, Yiyi Yu, Joe Canzano et al.

ICLR 2024arXiv:2311.00136
19
citations
#65

Exploring Diverse Representations for Open Set Recognition

Yu Wang, Junxian Mu, Pengfei Zhu et al.

AAAI 2024arXiv:2401.06521
open set recognitionattention diversity regularizationmulti-expert fusiondiscriminative models+4
18
citations
#66

GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation

Hongyin Zhang, Pengxiang Ding, Shangke Lyu et al.

ICLR 2025
17
citations
#67

SynFlowNet: Design of Diverse and Novel Molecules with Synthesis Constraints

Miruna Cretu, Charles Harris, Ilia Igashov et al.

ICLR 2025arXiv:2405.01155
generative modelscomputer-aided drug designsynthetic accessibilitychemical reaction space+4
17
citations
#68

Controlling Language and Diffusion Models by Transporting Activations

Pau Rodriguez, Arno Blaas, Michal Klein et al.

ICLR 2025arXiv:2410.23054
17
citations
#69

Towards Effective Usage of Human-Centric Priors in Diffusion Models for Text-based Human Image Generation

Junyan Wang, Zhenhong Sun, Stewart Tan et al.

CVPR 2024arXiv:2403.05239
17
citations
#70

Neighboring Autoregressive Modeling for Efficient Visual Generation

Yefei He, Yuanyu He, Shaoxuan He et al.

ICCV 2025arXiv:2503.10696
16
citations
#71

A Unified Comparative Study with Generalized Conformity Scores for Multi-Output Conformal Regression

Victor Dheur, Matteo Fontana, Yorick Estievenart et al.

ICML 2025
16
citations
#72

Learning Interpretable Hierarchical Dynamical Systems Models from Time Series Data

Manuel Brenner, Elias Weber, Georgia Koppe et al.

ICLR 2025arXiv:2410.04814
dynamical systems reconstructionhierarchical modelingtime series analysismulti-domain learning+4
16
citations
#73

Is Artificial Intelligence Generated Image Detection a Solved Problem?

Ziqiang Li, Jiazhen Yan, Ziwen He et al.

NeurIPS 2025arXiv:2505.12335
ai-generated image detectiongenerative model robustnessdeepfake detectionimage forensics+3
15
citations
#74

Boosting Generative Image Modeling via Joint Image-Feature Synthesis

Theodoros Kouzelis, Efstathios Karypidis, Ioannis Kakogeorgiou et al.

NeurIPS 2025
15
citations
#75

Multimarginal Generative Modeling with Stochastic Interpolants

Michael Albergo, Nicholas Boffi, Michael Lindsey et al.

ICLR 2024
15
citations
#76

AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling

Zhining Zhang, Chuanyang Jin, Mung Yao Jia et al.

NeurIPS 2025arXiv:2502.15676
15
citations
#77

Neural-Symbolic Recursive Machine for Systematic Generalization

Qing Li, Yixin Zhu, Yitao Liang et al.

ICLR 2024arXiv:2210.01603
14
citations
#78

AutoPartGen: Autoregressive 3D Part Generation and Discovery

Minghao Chen, Jianyuan Wang, Roman Shapovalov et al.

NeurIPS 2025
autoregressive 3d generationpart-based generation3d part discoverycompositional 3d reconstruction+3
14
citations
#79

Generative Flows on Synthetic Pathway for Drug Design

Seonghwan Seo, Minsu Kim, Tony Shen et al.

ICLR 2025arXiv:2410.04542
generative flow networksdrug designmolecular building blockschemical reaction templates+4
14
citations
#80

Low-Light Image Enhancement via Generative Perceptual Priors

Han Zhou, Wei Dong, Xiaohong Liu et al.

AAAI 2025arXiv:2412.20916
14
citations
#81

Weak-to-Strong Generalization Through the Data-Centric Lens

Changho Shin, John Cooper, Frederic Sala

ICLR 2025arXiv:2412.03881
weak-to-strong generalizationoverlap densitydata-centric mechanismsuperalignment+2
14
citations
#82

MERGE: Fast Private Text Generation

Zi Liang, Pinghui Wang, Ruofei Zhang et al.

AAAI 2024arXiv:2305.15769
private inferencetransformer-based modelsnatural language generationcloud model deployment+4
14
citations
#83

Improving Geo-diversity of Generated Images with Contextualized Vendi Score Guidance

Reyhane Askari Hemmat, Melissa Hall, Alicia Yi Sun et al.

ECCV 2024arXiv:2406.04551
text-to-image generationlatent diffusion modelsgeo-diversity biasinference time intervention+4
13
citations
#84

A Periodic Bayesian Flow for Material Generation

Hanlin Wu, Yuxuan Song, Jingjing Gong et al.

ICLR 2025
13
citations
#85

Towards a Theoretical Understanding of Synthetic Data in LLM Post-Training: A Reverse-Bottleneck Perspective

Zeyu Gan, Yong Liu

ICLR 2025arXiv:2410.01720
synthetic data generationlarge language modelspost-training optimizationgeneralization capability+3
13
citations
#86

MaxFusion: Plug&Play Multi-Modal Generation in Text-to-Image Diffusion Models

Nithin Gopalakrishnan Nair, Jeya Maria Jose Valanarasu, Vishal Patel

ECCV 2024
13
citations
#87

Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction

Vaishnavh Nagarajan, Chen Wu, Charles Ding et al.

ICML 2025arXiv:2504.15266
13
citations
#88

Identifiable Exchangeable Mechanisms for Causal Structure and Representation Learning

Patrik Reizinger, Siyuan Guo, Ferenc Huszar et al.

ICLR 2025arXiv:2406.14302
13
citations
#89

Force Prompting: Video Generation Models Can Learn And Generalize Physics-based Control Signals

Nate Gillman, Charles Herrmann, Michael Freeman et al.

NeurIPS 2025
13
citations
#90

Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modeling

Guiyu Zhang, Huan-ang Gao, Zijian Jiang et al.

ICLR 2025arXiv:2410.11236
13
citations
#91

Real Appearance Modeling for More General Deepfake Detection

Jiahe Tian, Yu Cai, Xi Wang et al.

ECCV 2024
deepfake detectiongeneralizable detectionreal appearance modelingface disturbance+2
12
citations
#92

∞-Brush: Controllable Large Image Synthesis with Diffusion Models in Infinite Dimensions

Minh Quan Le, Alexandros Graikos, Srikar Yellapragada et al.

ECCV 2024arXiv:2407.14709
diffusion modelsinfinite dimensionslarge image synthesiscross-attention neural operator+4
12
citations
#93

Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution

Fengyuan Liu, Haochen Luo, Yiming Li et al.

ECCV 2024
12
citations
#94

Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator

Kaiwen Zheng, Yongxin Chen, Huayu Chen et al.

ICML 2025arXiv:2503.01103
12
citations
#95

Generative Classifiers Avoid Shortcut Solutions

Alexander Li, Ananya Kumar, Deepak Pathak

ICLR 2025arXiv:2512.25034
12
citations
#96

Unbounded: A Generative Infinite Game of Character Life Simulation

Jialu Li, Yuanzhen Li, Neal Wadhwa et al.

ICLR 2025
12
citations
#97

How to Synthesize Text Data without Model Collapse?

Xuekai Zhu, Daixuan Cheng, Hengli Li et al.

ICML 2025arXiv:2412.14689
12
citations
#98

RealGeneral: Unifying Visual Generation via Temporal In-Context Learning with Video Models

Yijing Lin, Mengqi Huang, Shuhan Zhuang et al.

ICCV 2025arXiv:2503.10406
temporal in-context learningconditional frame predictionunified visual generationvideo models+4
12
citations
#99

Neural Causal Abstractions

Kevin Xia, Elias Bareinboim

AAAI 2024arXiv:2401.02602
causal abstractions theorycausal inference tasksneural causal modelsrepresentation learning+4
12
citations
#100

Glad: A Streaming Scene Generator for Autonomous Driving

Bin Xie, Yingfei Liu, Tiancai Wang et al.

ICLR 2025arXiv:2503.00045
11
citations