Xi Chen
71
Papers
16,944
Total Citations
1
Affiliations
Affiliations
Google Research
Papers (71)
Improved Techniques for Training GANs
NeurIPS 2016arXiv
9,891
citations
InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets
NeurIPS 2016arXiv
4,424
citations
Improved Variational Inference with Inverse Autoregressive Flow
NeurIPS 2016arXiv
1,936
citations
On Scaling Up a Multilingual Vision and Language Model
CVPR 2024
254
citations
OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation
ECCV 2024arXiv
82
citations
VIME: Variational Information Maximizing Exploration
NeurIPS 2016arXiv
80
citations
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics
CVPR 2025
70
citations
PolyVoice: Language Models for Speech to Speech Translation
ICLR 2024
29
citations
Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation
CVPR 2024
23
citations
EnvGS: Modeling View-Dependent Appearance with Environment Gaussian
CVPR 2025
16
citations
ViLLa: Video Reasoning Segmentation with Large Language Model
ICCV 2025
16
citations
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models
ICLR 2025
15
citations
On the Recursive Teaching Dimension of VC Classes
NeurIPS 2016
15
citations
Dual-Window Multiscale Transformer for Hyperspectral Snapshot Compressive Imaging
AAAI 2024
13
citations
MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
CVPR 2025
13
citations
Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding
ICCV 2025
11
citations
NoT: Federated Unlearning via Weight Negation
CVPR 2025
11
citations
ObjectMover: Generative Object Movement with Video Prior
CVPR 2025
10
citations
Online Video Understanding: OVBench and VideoChat-Online
CVPR 2025arXiv
9
citations
Asynchronous Federated Clustering with Unknown Number of Clusters
AAAI 2025
8
citations
ROSE: Remove Objects with Side Effects in Videos
NeurIPS 2025
4
citations
Unleashing Diffusion Transformers for Visual Correspondence by Modulating Massive Activations
NeurIPS 2025
4
citations
Exploiting Symmetric Temporally Sparse BPTT for Efficient RNN Training
AAAI 2024arXiv
4
citations
Less or More From Teacher: Exploiting Trilateral Geometry For Knowledge Distillation
ICLR 2024
3
citations
PlayerOne: Egocentric World Simulator
NeurIPS 2025
3
citations
The Distributional Reward Critic Framework for Reinforcement Learning Under Perturbed Rewards
AAAI 2025
0
citations
Decoupling Metacognition from Cognition: A Framework for Quantifying Metacognitive Ability in LLMs
AAAI 2025
0
citations
Disentangled Modeling of Preferences and Social Influence for Group Recommendation
AAAI 2025
0
citations
Calibrated One Round Federated Learning with Bayesian Inference in the Predictive Space
AAAI 2024
0
citations
Reverse Region-to-Entity Annotation for Pixel-Level Visual Entity Linking
AAAI 2025
0
citations
AnyDoor: Zero-shot Object-level Image Customization
CVPR 2024
0
citations
HFF-Tracker: A Hierarchical Fine-grained Fusion Tracker for Referring Multi-Object Tracking
AAAI 2025
0
citations
VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding
AAAI 2025
0
citations
PredToken: Predicting Unknown Tokens and Beyond with Coarse-to-Fine Iterative Decoding
CVPR 2024
0
citations
TC-LLaVA: Rethinking the Transfer of LLava from Image to Video Understanding with Temporal Considerations
AAAI 2025
0
citations
Zero-shot Denoising via Neural Compression: Theoretical and algorithmic framework
NeurIPS 2025
0
citations
Bagged Deep Image Prior for Recovering Images in the Presence of Speckle Noise
ICML 2024
0
citations
Rethinking Generative Large Language Model Evaluation for Semantic Comprehension
ICML 2024
0
citations
Understanding the Training Speedup from Sampling with Approximate Losses
ICML 2024
0
citations
Resolution Adaptive Networks for Efficient Inference
CVPR 2020arXiv
0
citations
State-Aware Tracker for Real-Time Video Object Segmentation
CVPR 2020arXiv
0
citations
FocalClick: Towards Practical Interactive Image Segmentation
CVPR 2022arXiv
0
citations
Dynamically Instance-Guided Adaptation: A Backward-Free Approach for Test-Time Domain Adaptive Semantic Segmentation
CVPR 2023
0
citations
Improving Robust Generalization by Direct PAC-Bayesian Bound Minimization
CVPR 2023arXiv
0
citations
Detecting Everything in the Open World: Towards Universal Object Detection
CVPR 2023arXiv
0
citations
Conditional Diffusion for Interactive Segmentation
ICCV 2021
0
citations
Open-vocabulary Panoptic Segmentation with Embedding Modulation
ICCV 2023arXiv
0
citations
Understanding Hessian Alignment for Domain Generalization
ICCV 2023arXiv
0
citations
PreSTU: Pre-Training for Scene-Text Understanding
ICCV 2023arXiv
0
citations
Autoregressive Uncertainty Modeling for 3D Bounding Box Prediction
ECCV 2022
0
citations
PACTran: PAC-Bayesian Metrics for Estimating the Transferability of Pretrained Models to Classification Tasks
ECCV 2022
0
citations
GenieBlue: Integrating both Linguistic and Multimodal Capabilities for Large Language Models on Mobile Devices
ICCV 2025
0
citations
DiffDoctor: Diagnosing Image Diffusion Models Before Treating
ICCV 2025
0
citations
UniGraspTransformer: Simplified Policy Distillation for Scalable Dexterous Robotic Grasping
CVPR 2025
0
citations
EIDT-V: Exploiting Intersections in Diffusion Trajectories for Model-Agnostic, Zero-Shot, Training-Free Text-to-Video Generation
CVPR 2025
0
citations
MangaNinja: Line Art Colorization with Precise Reference Following
CVPR 2025
0
citations
Near-Optimal Policies for Dynamic Multinomial Logit Assortment Selection Models
NeurIPS 2018
0
citations
Online EXP3 Learning in Adversarial Bandits with Delayed Feedback
NeurIPS 2019
0
citations
Information Theoretic Counterfactual Learning from Missing-Not-At-Random Feedback
NeurIPS 2020
0
citations
Fixed-Support Wasserstein Barycenters: Computational Hardness and Fast Algorithm
NeurIPS 2020
0
citations
Hedging in games: Faster convergence of external and swap regrets
NeurIPS 2020
0
citations
Generalized DataWeighting via Class-Level Gradient Manipulation
NeurIPS 2021
0
citations
Bridging the Gap Between Practice and PAC-Bayes Theory in Few-Shot Meta-Learning
NeurIPS 2021
0
citations
LAPO: Latent-Variable Advantage-Weighted Policy Optimization for Offline Reinforcement Learning
NeurIPS 2022
0
citations
TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation
NeurIPS 2023
0
citations
Uni3DETR: Unified 3D Detection Transformer
NeurIPS 2023
0
citations
Large-Scale Markov Decision Problems with KL Control Cost and its Application to Crowdsourcing
ICML 2015
0
citations
Benchmarking Deep Reinforcement Learning for Continuous Control
ICML 2016
0
citations
Adaptive Multiple-Arm Identification
ICML 2017
0
citations
Flow++: Improving Flow-Based Generative Models with Variational Dequantization and Architecture Design
ICML 2019
0
citations
Population Based Augmentation: Efficient Learning of Augmentation Policy Schedules
ICML 2019
0
citations