Yu Liu

105
Papers
636
Total Citations

Papers (105)

Combinatorial Multi-Armed Bandit with General Reward Functions

NeurIPS 2016arXiv
146
citations

Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following

CVPR 2024
78
citations

Space Group Constrained Crystal Generation

ICLR 2024
60
citations

Learning Where to Focus for Efficient Video Object Detection

ECCV 2020
60
citations

Universal Actions for Enhanced Embodied Foundation Models

CVPR 2025
42
citations

SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction

CVPR 2024
38
citations

Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance

CVPR 2024
35
citations

A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning

AAAI 2024arXiv
25
citations

Fast Context-Based Low-Light Image Enhancement via Neural Implicit Representations

ECCV 2024
24
citations

Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation

CVPR 2024
23
citations

Lipschitz Singularities in Diffusion Models

ICLR 2024
21
citations

Improved Video VAE for Latent Video Diffusion Model

CVPR 2025arXiv
19
citations

Decompositional Neural Scene Reconstruction with Generative Diffusion Prior

CVPR 2025
18
citations

Novel Class Discovery for Ultra-Fine-Grained Visual Categorization

CVPR 2024
13
citations

TACO: Taming Diffusion for in-the-wild Video Amodal Completion

ICCV 2025
9
citations

Elegantly Written: Disentangling Writer and Character Styles for Enhancing Online Chinese Handwriting

ECCV 2024
6
citations

Unsupervised Sequence Classification using Sequential Output Statistics

NeurIPS 2017arXiv
5
citations

IDEA-Bench: How Far are Generative Models from Professional Designing?

CVPR 2025
4
citations

BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs

CVPR 2025
3
citations

NTR-Gaussian: Nighttime Dynamic Thermal Reconstruction with 4D Gaussian Splatting Based on Thermodynamics

CVPR 2025arXiv
2
citations

AUC Optimization from Multiple Unlabeled Datasets

AAAI 2024arXiv
2
citations

See Further When Clear: Curriculum Consistency Model

CVPR 2025
2
citations

Effect Size Estimation for Duration Recommendation in Online Experiments: Leveraging Hierarchical Models and Objective Utility Approaches

AAAI 2024arXiv
1
citations

StrokeNUWA—Tokenizing Strokes for Vector Graphic Synthesis

ICML 2024
0
citations

CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency Models

ICML 2024
0
citations

DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning

ICML 2024
0
citations

Learning Relaxed Deep Supervision for Better Edge Detection

CVPR 2016
0
citations

Quality Aware Network for Set to Set Recognition

CVPR 2017arXiv
0
citations

Scale-Aware Face Detection

CVPR 2017arXiv
0
citations

Hyperparameter Optimization for Tracking With Continuous Deep Q-Learning

CVPR 2018
0
citations

MoNet: Deep Motion Exploitation for Video Object Segmentation

CVPR 2018
0
citations

Exploring Disentangled Feature Representation Beyond Face Identification

CVPR 2018arXiv
0
citations

Beyond Trade-Off: Accelerate FCN-Based Face Detector With Higher Accuracy

CVPR 2018arXiv
0
citations

RGBD Based Dimensional Decomposition Residual Network for 3D Semantic Scene Completion

CVPR 2019
0
citations

Conditional Adversarial Generative Flow for Controllable Image Synthesis

CVPR 2019
0
citations

Anisotropic Convolutional Networks for 3D Semantic Scene Completion

CVPR 2020arXiv
0
citations

Rotate-and-Render: Unsupervised Photorealistic Face Rotation From Single-View Images

CVPR 2020
0
citations

Search to Distill: Pearls Are Everywhere but Not the Eyes

CVPR 2020arXiv
0
citations

DPGN: Distribution Propagation Graph Network for Few-Shot Learning

CVPR 2020arXiv
0
citations

Revisiting the Sibling Head in Object Detector

CVPR 2020arXiv
0
citations

Communication Efficient SGD via Gradient Sampling With Bayes Prior

CVPR 2021
0
citations

Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization

CVPR 2021arXiv
0
citations

Lifelong Person Re-Identification via Adaptive Knowledge Accumulation

CVPR 2021arXiv
0
citations

Self-Supervised Video Representation Learning by Context and Motion Decoupling

CVPR 2021arXiv
0
citations

Segment, Magnify and Reiterate: Detecting Camouflaged Objects the Hard Way

CVPR 2022
0
citations

MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers

CVPR 2023arXiv
0
citations

Long-Term Visual Localization With Mobile Sensors

CVPR 2023arXiv
0
citations

Dimensionality-Varying Diffusion Process

CVPR 2023arXiv
0
citations

ReasonNet: End-to-End Driving With Temporal and Global Reasoning

CVPR 2023
0
citations

Recurrent Scale Approximation for Object Detection in CNN

ICCV 2017arXiv
0
citations

Learning a Recurrent Residual Fusion Network for Multimodal Matching

ICCV 2017
0
citations

Knowledge Distillation via Route Constrained Optimization

ICCV 2019
0
citations

Exploiting Temporal Consistency for Real-Time Video Depth Estimation

ICCV 2019
0
citations

Differentiable Kernel Evolution

ICCV 2019
0
citations

Correlation Congruence for Knowledge Distillation

ICCV 2019
0
citations

Scalable Place Recognition Under Appearance Change for Autonomous Driving

ICCV 2019
0
citations

Switchable K-Class Hyperplanes for Noise-Robust Representation Learning

ICCV 2021
0
citations

DETRs with Collaborative Hybrid Assignments Training

ICCV 2023arXiv
0
citations

Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection

ICCV 2023
0
citations

Generating Dynamic Kernels via Transformers for Lane Detection

ICCV 2023
0
citations

GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D Understanding

ICCV 2023arXiv
0
citations

UniKD: Universal Knowledge Distillation for Mimicking Homogeneous or Heterogeneous Object Detectors

ICCV 2023
0
citations

Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers

ICCV 2023
0
citations

3D Semantic Subspace Traverser: Empowering 3D Generative Model with Shape Editing Capability

ICCV 2023arXiv
0
citations

Deep Active Contours for Real-time 6-DoF Object Tracking

ICCV 2023
0
citations

Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction

ICCV 2023arXiv
0
citations

Discriminability Distillation in Group Representation Learning

ECCV 2020
0
citations

More Classifiers, Less Forgetting: A Generic Multi-classifier Paradigm for Incremental Learning

ECCV 2020
0
citations

Camera Auto-Calibration from the Steiner Conic of the Fundamental Matrix

ECCV 2022
0
citations

Unifying Visual Perception by Dispersible Points Learning

ECCV 2022
0
citations

Self-Slimmed Vision Transformer

ECCV 2022
0
citations

Rethinking Robust Representation Learning under Fine-Grained Noisy Faces

ECCV 2022
0
citations

Towards Robust Face Recognition with Comprehensive Search

ECCV 2022
0
citations

GeoAug: Data Augmentation for Few-Shot NeRF with Geometry Constraints

ECCV 2022
0
citations

"UniNet: Unified Architecture Search with Convolution, Transformer, and MLP"

ECCV 2022
0
citations

TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers

ECCV 2022
0
citations

Masked Autoencoders Are Stronger Knowledge Distillers

ICCV 2023
0
citations

MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes

CVPR 2025
0
citations

MangaNinja: Line Art Colorization with Precise Reference Following

CVPR 2025
0
citations

DiffDoctor: Diagnosing Image Diffusion Models Before Treating

ICCV 2025
0
citations

Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy

ICCV 2025
0
citations

ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing

ICCV 2025
0
citations

VACE: All-in-One Video Creation and Editing

ICCV 2025
0
citations

Pretrained Reversible Generation as Unsupervised Visual Representation Learning

ICCV 2025
0
citations

LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment

ICCV 2025
0
citations

UniFuse: A Unified All-in-One Framework for Multi-Modal Medical Image Fusion Under Diverse Degradations and Misalignments

ICCV 2025
0
citations

Improving Pointing Accuracy for 3D Target Selection in Virtual Reality through Depth Perception Biases Correction

ISMAR 2025
0
citations

As Pseudo-Label Free as Possible: Leveraging Adaptive Feature Generation for Sparsely Annotated Object Detection

AAAI 2025
0
citations

CI-STHPAN: Pre-trained Attention Network for Stock Selection with Channel-Independent Spatio-Temporal Hypergraph

AAAI 2024
0
citations

Critic-Guided Decision Transformer for Offline Reinforcement Learning

AAAI 2024
0
citations

GMP-AR: Granularity Message Passing and Adaptive Reconciliation for Temporal Hierarchy Forecasting

AAAI 2024arXiv
0
citations

Causality-Inspired Invariant Representation Learning for Text-Based Person Retrieval

AAAI 2024
0
citations

Check Locate Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation

CVPR 2024
0
citations

AnyDoor: Zero-shot Object-level Image Customization

CVPR 2024
0
citations

GLID: Pre-training a Generalist Encoder-Decoder Vision Model

CVPR 2024
0
citations

EasyDrag: Efficient Point-based Manipulation on Diffusion Models

CVPR 2024
0
citations

LMDrive: Closed-Loop End-to-End Driving with Large Language Models

CVPR 2024
0
citations

CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement

CVPR 2024
0
citations

DreamVideo: Composing Your Dream Videos with Customized Subject and Motion

CVPR 2024
0
citations

Derivative Estimation in Random Design

NeurIPS 2018
0
citations

Large-batch Optimization for Dense Visual Predictions: Training Faster R-CNN in 4.2 Minutes

NeurIPS 2022
0
citations

LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios

NeurIPS 2023
0
citations

RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths

NeurIPS 2023
0
citations

Customizable Image Synthesis with Multiple Subjects

NeurIPS 2023
0
citations

K-Means Clustering with Distributed Dimensions

ICML 2016
0
citations