Xi Chen

71
Papers
16,944
Total Citations
1
Affiliations

Affiliations

Google Research

Papers (71)

Improved Techniques for Training GANs

NeurIPS 2016arXiv
9,891
citations

InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets

NeurIPS 2016arXiv
4,424
citations

Improved Variational Inference with Inverse Autoregressive Flow

NeurIPS 2016arXiv
1,936
citations

On Scaling Up a Multilingual Vision and Language Model

CVPR 2024
254
citations

OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation

ECCV 2024arXiv
82
citations

VIME: Variational Information Maximizing Exploration

NeurIPS 2016arXiv
80
citations

UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics

CVPR 2025
70
citations

PolyVoice: Language Models for Speech to Speech Translation

ICLR 2024
29
citations

Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation

CVPR 2024
23
citations

EnvGS: Modeling View-Dependent Appearance with Environment Gaussian

CVPR 2025
16
citations

ViLLa: Video Reasoning Segmentation with Large Language Model

ICCV 2025
16
citations

GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models

ICLR 2025
15
citations

On the Recursive Teaching Dimension of VC Classes

NeurIPS 2016
15
citations

Dual-Window Multiscale Transformer for Hyperspectral Snapshot Compressive Imaging

AAAI 2024
13
citations

MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation

CVPR 2025
13
citations

Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding

ICCV 2025
11
citations

NoT: Federated Unlearning via Weight Negation

CVPR 2025
11
citations

ObjectMover: Generative Object Movement with Video Prior

CVPR 2025
10
citations

Online Video Understanding: OVBench and VideoChat-Online

CVPR 2025arXiv
9
citations

Asynchronous Federated Clustering with Unknown Number of Clusters

AAAI 2025
8
citations

ROSE: Remove Objects with Side Effects in Videos

NeurIPS 2025
4
citations

Unleashing Diffusion Transformers for Visual Correspondence by Modulating Massive Activations

NeurIPS 2025
4
citations

Exploiting Symmetric Temporally Sparse BPTT for Efficient RNN Training

AAAI 2024arXiv
4
citations

Less or More From Teacher: Exploiting Trilateral Geometry For Knowledge Distillation

ICLR 2024
3
citations

PlayerOne: Egocentric World Simulator

NeurIPS 2025
3
citations

The Distributional Reward Critic Framework for Reinforcement Learning Under Perturbed Rewards

AAAI 2025
0
citations

Decoupling Metacognition from Cognition: A Framework for Quantifying Metacognitive Ability in LLMs

AAAI 2025
0
citations

Disentangled Modeling of Preferences and Social Influence for Group Recommendation

AAAI 2025
0
citations

Calibrated One Round Federated Learning with Bayesian Inference in the Predictive Space

AAAI 2024
0
citations

Reverse Region-to-Entity Annotation for Pixel-Level Visual Entity Linking

AAAI 2025
0
citations

AnyDoor: Zero-shot Object-level Image Customization

CVPR 2024
0
citations

HFF-Tracker: A Hierarchical Fine-grained Fusion Tracker for Referring Multi-Object Tracking

AAAI 2025
0
citations

VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding

AAAI 2025
0
citations

PredToken: Predicting Unknown Tokens and Beyond with Coarse-to-Fine Iterative Decoding

CVPR 2024
0
citations

TC-LLaVA: Rethinking the Transfer of LLava from Image to Video Understanding with Temporal Considerations

AAAI 2025
0
citations

Zero-shot Denoising via Neural Compression: Theoretical and algorithmic framework

NeurIPS 2025
0
citations

Bagged Deep Image Prior for Recovering Images in the Presence of Speckle Noise

ICML 2024
0
citations

Rethinking Generative Large Language Model Evaluation for Semantic Comprehension

ICML 2024
0
citations

Understanding the Training Speedup from Sampling with Approximate Losses

ICML 2024
0
citations

Resolution Adaptive Networks for Efficient Inference

CVPR 2020arXiv
0
citations

State-Aware Tracker for Real-Time Video Object Segmentation

CVPR 2020arXiv
0
citations

FocalClick: Towards Practical Interactive Image Segmentation

CVPR 2022arXiv
0
citations

Dynamically Instance-Guided Adaptation: A Backward-Free Approach for Test-Time Domain Adaptive Semantic Segmentation

CVPR 2023
0
citations

Improving Robust Generalization by Direct PAC-Bayesian Bound Minimization

CVPR 2023arXiv
0
citations

Detecting Everything in the Open World: Towards Universal Object Detection

CVPR 2023arXiv
0
citations

Conditional Diffusion for Interactive Segmentation

ICCV 2021
0
citations

Open-vocabulary Panoptic Segmentation with Embedding Modulation

ICCV 2023arXiv
0
citations

Understanding Hessian Alignment for Domain Generalization

ICCV 2023arXiv
0
citations

PreSTU: Pre-Training for Scene-Text Understanding

ICCV 2023arXiv
0
citations

Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction

ECCV 2022
0
citations

PACTran: PAC-Bayesian Metrics for Estimating the Transferability of Pretrained Models to Classification Tasks

ECCV 2022
0
citations

GenieBlue: Integrating both Linguistic and Multimodal Capabilities for Large Language Models on Mobile Devices

ICCV 2025
0
citations

DiffDoctor: Diagnosing Image Diffusion Models Before Treating

ICCV 2025
0
citations

UniGraspTransformer: Simplified Policy Distillation for Scalable Dexterous Robotic Grasping

CVPR 2025
0
citations

EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation

CVPR 2025
0
citations

MangaNinja: Line Art Colorization with Precise Reference Following

CVPR 2025
0
citations

Near-Optimal Policies for Dynamic Multinomial Logit Assortment Selection Models

NeurIPS 2018
0
citations

Online EXP3 Learning in Adversarial Bandits with Delayed Feedback

NeurIPS 2019
0
citations

Information Theoretic Counterfactual Learning from Missing-Not-At-Random Feedback

NeurIPS 2020
0
citations

Fixed-Support Wasserstein Barycenters: Computational Hardness and Fast Algorithm

NeurIPS 2020
0
citations

Hedging in games: Faster convergence of external and swap regrets

NeurIPS 2020
0
citations

Generalized DataWeighting via Class-Level Gradient Manipulation

NeurIPS 2021
0
citations

Bridging the Gap Between Practice and PAC-Bayes Theory in Few-Shot Meta-Learning

NeurIPS 2021
0
citations

LAPO: Latent-Variable Advantage-Weighted Policy Optimization for Offline Reinforcement Learning

NeurIPS 2022
0
citations

TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation

NeurIPS 2023
0
citations

Uni3DETR: Unified 3D Detection Transformer

NeurIPS 2023
0
citations

Large-Scale Markov Decision Problems with KL Control Cost and its Application to Crowdsourcing

ICML 2015
0
citations

Benchmarking Deep Reinforcement Learning for Continuous Control

ICML 2016
0
citations

Adaptive Multiple-Arm Identification

ICML 2017
0
citations

Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture Design

ICML 2019
0
citations

Population Based Augmentation: Efficient Learning of Augmentation Policy Schedules

ICML 2019
0
citations