Chunyuan Li

38
Papers
1,381
Total Citations

Papers (38)

Variational Autoencoder for Deep Learning of Images, Labels and Captions

NeurIPS 2016arXiv
813
citations

ALICE: Towards Understanding Adversarial Learning for Joint Distribution Matching

NeurIPS 2017arXiv
225
citations

Triangle Generative Adversarial Networks

NeurIPS 2017arXiv
141
citations

Adversarial Symmetric Variational Autoencoder

NeurIPS 2017arXiv
79
citations

Visual In-Context Prompting

CVPR 2024
52
citations

Graphic Design with Large Multimodal Model

AAAI 2025
27
citations

Stochastic Gradient MCMC with Stale Gradients

NeurIPS 2016arXiv
23
citations

Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning

ICLR 2025
15
citations

VAE Learning via Stein Variational Gradient Descent

NeurIPS 2017arXiv
6
citations

Unified Contrastive Learning in Image-Text-Label Space

CVPR 2022arXiv
0
citations

Learning Customized Visual Models With Retrieval-Augmented Knowledge

CVPR 2023arXiv
0
citations

GLIGEN: Open-Set Grounded Text-to-Image Generation

CVPR 2023arXiv
0
citations

Generalized Decoding for Pixel, Image, and Language

CVPR 2023arXiv
0
citations

Exploring Robustness of Unsupervised Domain Adaptation in Semantic Segmentation

ICCV 2021arXiv
0
citations

A Simple Framework for Open-Vocabulary Segmentation and Detection

ICCV 2023arXiv
0
citations

Structure-Aware Human-Action Generation

ECCV 2020
0
citations

Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks

ECCV 2020
0
citations

Deep Temporal Sigmoid Belief Networks for Sequence Modeling

NeurIPS 2015
0
citations

LLaVA-Critic: Learning to Evaluate Multimodal Models

CVPR 2025
0
citations

Improved Baselines with Visual Instruction Tuning

CVPR 2024
0
citations

Position: TrustLLM: Trustworthiness in Large Language Models

ICML 2024
0
citations

Learning Weight Uncertainty With Stochastic Gradient MCMC for Shape Classification

CVPR 2016
0
citations

Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-Training

CVPR 2020arXiv
0
citations

Partition-Guided GANs

CVPR 2021arXiv
0
citations

Grounded Language-Image Pre-Training

CVPR 2022arXiv
0
citations

RegionCLIP: Region-Based Language-Image Pretraining

CVPR 2022arXiv
0
citations

Towards Language-Free Training for Text-to-Image Generation

CVPR 2022arXiv
0
citations

Twin Auxilary Classifiers GAN

NeurIPS 2019
0
citations

Focal Attention for Long-Range Interactions in Vision Transformers

NeurIPS 2021
0
citations

Focal Modulation Networks

NeurIPS 2022
0
citations

ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models

NeurIPS 2022
0
citations

K-LITE: Learning Transferable Visual Models with External Knowledge

NeurIPS 2022
0
citations

LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day

NeurIPS 2023
0
citations

Visual Instruction Tuning

NeurIPS 2023
0
citations

Large Language Models are Visual Reasoning Coordinators

NeurIPS 2023
0
citations

Adversarial Time-to-Event Modeling

ICML 2018
0
citations

Continuous-Time Flows for Efficient Inference and Density Estimation

ICML 2018
0
citations

Policy Optimization as Wasserstein Gradient Flows

ICML 2018
0
citations