Chunyuan Li
38
Papers
1,381
Total Citations
Papers (38)
Variational Autoencoder for Deep Learning of Images, Labels and Captions
NeurIPS 2016arXiv
813
citations
ALICE: Towards Understanding Adversarial Learning for Joint Distribution Matching
NeurIPS 2017arXiv
225
citations
Triangle Generative Adversarial Networks
NeurIPS 2017arXiv
141
citations
Adversarial Symmetric Variational Autoencoder
NeurIPS 2017arXiv
79
citations
Visual In-Context Prompting
CVPR 2024
52
citations
Graphic Design with Large Multimodal Model
AAAI 2025
27
citations
Stochastic Gradient MCMC with Stale Gradients
NeurIPS 2016arXiv
23
citations
Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning
ICLR 2025
15
citations
VAE Learning via Stein Variational Gradient Descent
NeurIPS 2017arXiv
6
citations
Unified Contrastive Learning in Image-Text-Label Space
CVPR 2022arXiv
0
citations
Learning Customized Visual Models With Retrieval-Augmented Knowledge
CVPR 2023arXiv
0
citations
GLIGEN: Open-Set Grounded Text-to-Image Generation
CVPR 2023arXiv
0
citations
Generalized Decoding for Pixel, Image, and Language
CVPR 2023arXiv
0
citations
Exploring Robustness of Unsupervised Domain Adaptation in Semantic Segmentation
ICCV 2021arXiv
0
citations
A Simple Framework for Open-Vocabulary Segmentation and Detection
ICCV 2023arXiv
0
citations
Structure-Aware Human-Action Generation
ECCV 2020
0
citations
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
ECCV 2020
0
citations
Deep Temporal Sigmoid Belief Networks for Sequence Modeling
NeurIPS 2015
0
citations
LLaVA-Critic: Learning to Evaluate Multimodal Models
CVPR 2025
0
citations
Improved Baselines with Visual Instruction Tuning
CVPR 2024
0
citations
Position: TrustLLM: Trustworthiness in Large Language Models
ICML 2024
0
citations
Learning Weight Uncertainty With Stochastic Gradient MCMC for Shape Classification
CVPR 2016
0
citations
Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-Training
CVPR 2020arXiv
0
citations
Partition-Guided GANs
CVPR 2021arXiv
0
citations
Grounded Language-Image Pre-Training
CVPR 2022arXiv
0
citations
RegionCLIP: Region-Based Language-Image Pretraining
CVPR 2022arXiv
0
citations
Towards Language-Free Training for Text-to-Image Generation
CVPR 2022arXiv
0
citations
Twin Auxilary Classifiers GAN
NeurIPS 2019
0
citations
Focal Attention for Long-Range Interactions in Vision Transformers
NeurIPS 2021
0
citations
Focal Modulation Networks
NeurIPS 2022
0
citations
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models
NeurIPS 2022
0
citations
K-LITE: Learning Transferable Visual Models with External Knowledge
NeurIPS 2022
0
citations
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
NeurIPS 2023
0
citations
Visual Instruction Tuning
NeurIPS 2023
0
citations
Large Language Models are Visual Reasoning Coordinators
NeurIPS 2023
0
citations
Adversarial Time-to-Event Modeling
ICML 2018
0
citations
Continuous-Time Flows for Efficient Inference and Density Estimation
ICML 2018
0
citations
Policy Optimization as Wasserstein Gradient Flows
ICML 2018
0
citations