Yu Liu
105
Papers
636
Total Citations
Papers (105)
Combinatorial Multi-Armed Bandit with General Reward Functions
NeurIPS 2016arXiv
146
citations
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
CVPR 2024
78
citations
Space Group Constrained Crystal Generation
ICLR 2024
60
citations
Learning Where to Focus for Efficient Video Object Detection
ECCV 2020
60
citations
Universal Actions for Enhanced Embodied Foundation Models
CVPR 2025
42
citations
SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction
CVPR 2024
38
citations
Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance
CVPR 2024
35
citations
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
AAAI 2024arXiv
25
citations
Fast Context-Based Low-Light Image Enhancement via Neural Implicit Representations
ECCV 2024
24
citations
Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation
CVPR 2024
23
citations
Lipschitz Singularities in Diffusion Models
ICLR 2024
21
citations
Improved Video VAE for Latent Video Diffusion Model
CVPR 2025arXiv
19
citations
Decompositional Neural Scene Reconstruction with Generative Diffusion Prior
CVPR 2025
18
citations
Novel Class Discovery for Ultra-Fine-Grained Visual Categorization
CVPR 2024
13
citations
TACO: Taming Diffusion for in-the-wild Video Amodal Completion
ICCV 2025
9
citations
Elegantly Written: Disentangling Writer and Character Styles for Enhancing Online Chinese Handwriting
ECCV 2024
6
citations
Unsupervised Sequence Classification using Sequential Output Statistics
NeurIPS 2017arXiv
5
citations
IDEA-Bench: How Far are Generative Models from Professional Designing?
CVPR 2025
4
citations
BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs
CVPR 2025
3
citations
NTR-Gaussian: Nighttime Dynamic Thermal Reconstruction with 4D Gaussian Splatting Based on Thermodynamics
CVPR 2025arXiv
2
citations
AUC Optimization from Multiple Unlabeled Datasets
AAAI 2024arXiv
2
citations
See Further When Clear: Curriculum Consistency Model
CVPR 2025
2
citations
Effect Size Estimation for Duration Recommendation in Online Experiments: Leveraging Hierarchical Models and Objective Utility Approaches
AAAI 2024arXiv
1
citations
StrokeNUWA—Tokenizing Strokes for Vector Graphic Synthesis
ICML 2024
0
citations
CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency Models
ICML 2024
0
citations
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning
ICML 2024
0
citations
Learning Relaxed Deep Supervision for Better Edge Detection
CVPR 2016
0
citations
Quality Aware Network for Set to Set Recognition
CVPR 2017arXiv
0
citations
Scale-Aware Face Detection
CVPR 2017arXiv
0
citations
Hyperparameter Optimization for Tracking With Continuous Deep Q-Learning
CVPR 2018
0
citations
MoNet: Deep Motion Exploitation for Video Object Segmentation
CVPR 2018
0
citations
Exploring Disentangled Feature Representation Beyond Face Identification
CVPR 2018arXiv
0
citations
Beyond Trade-Off: Accelerate FCN-Based Face Detector With Higher Accuracy
CVPR 2018arXiv
0
citations
RGBD Based Dimensional Decomposition Residual Network for 3D Semantic Scene Completion
CVPR 2019
0
citations
Conditional Adversarial Generative Flow for Controllable Image Synthesis
CVPR 2019
0
citations
Anisotropic Convolutional Networks for 3D Semantic Scene Completion
CVPR 2020arXiv
0
citations
Rotate-and-Render: Unsupervised Photorealistic Face Rotation From Single-View Images
CVPR 2020
0
citations
Search to Distill: Pearls Are Everywhere but Not the Eyes
CVPR 2020arXiv
0
citations
DPGN: Distribution Propagation Graph Network for Few-Shot Learning
CVPR 2020arXiv
0
citations
Revisiting the Sibling Head in Object Detector
CVPR 2020arXiv
0
citations
Communication Efficient SGD via Gradient Sampling With Bayes Prior
CVPR 2021
0
citations
Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization
CVPR 2021arXiv
0
citations
Lifelong Person Re-Identification via Adaptive Knowledge Accumulation
CVPR 2021arXiv
0
citations
Self-Supervised Video Representation Learning by Context and Motion Decoupling
CVPR 2021arXiv
0
citations
Segment, Magnify and Reiterate: Detecting Camouflaged Objects the Hard Way
CVPR 2022
0
citations
MixMAE: Mixed and Masked Autoencoder for Efficient Pretraining of Hierarchical Vision Transformers
CVPR 2023arXiv
0
citations
Long-Term Visual Localization With Mobile Sensors
CVPR 2023arXiv
0
citations
Dimensionality-Varying Diffusion Process
CVPR 2023arXiv
0
citations
ReasonNet: End-to-End Driving With Temporal and Global Reasoning
CVPR 2023
0
citations
Recurrent Scale Approximation for Object Detection in CNN
ICCV 2017arXiv
0
citations
Learning a Recurrent Residual Fusion Network for Multimodal Matching
ICCV 2017
0
citations
Knowledge Distillation via Route Constrained Optimization
ICCV 2019
0
citations
Exploiting Temporal Consistency for Real-Time Video Depth Estimation
ICCV 2019
0
citations
Differentiable Kernel Evolution
ICCV 2019
0
citations
Correlation Congruence for Knowledge Distillation
ICCV 2019
0
citations
Scalable Place Recognition Under Appearance Change for Autonomous Driving
ICCV 2019
0
citations
Switchable K-Class Hyperplanes for Noise-Robust Representation Learning
ICCV 2021
0
citations
DETRs with Collaborative Hybrid Assignments Training
ICCV 2023arXiv
0
citations
Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection
ICCV 2023
0
citations
Generating Dynamic Kernels via Transformers for Lane Detection
ICCV 2023
0
citations
GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D Understanding
ICCV 2023arXiv
0
citations
UniKD: Universal Knowledge Distillation for Mimicking Homogeneous or Heterogeneous Object Detectors
ICCV 2023
0
citations
Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers
ICCV 2023
0
citations
3D Semantic Subspace Traverser: Empowering 3D Generative Model with Shape Editing Capability
ICCV 2023arXiv
0
citations
Deep Active Contours for Real-time 6-DoF Object Tracking
ICCV 2023
0
citations
Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction
ICCV 2023arXiv
0
citations
Discriminability Distillation in Group Representation Learning
ECCV 2020
0
citations
More Classifiers, Less Forgetting: A Generic Multi-classifier Paradigm for Incremental Learning
ECCV 2020
0
citations
Camera Auto-Calibration from the Steiner Conic of the Fundamental Matrix
ECCV 2022
0
citations
Unifying Visual Perception by Dispersible Points Learning
ECCV 2022
0
citations
Self-Slimmed Vision Transformer
ECCV 2022
0
citations
Rethinking Robust Representation Learning under Fine-Grained Noisy Faces
ECCV 2022
0
citations
Towards Robust Face Recognition with Comprehensive Search
ECCV 2022
0
citations
GeoAug: Data Augmentation for Few-Shot NeRF with Geometry Constraints
ECCV 2022
0
citations
"UniNet: Unified Architecture Search with Convolution, Transformer, and MLP"
ECCV 2022
0
citations
TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers
ECCV 2022
0
citations
Masked Autoencoders Are Stronger Knowledge Distillers
ICCV 2023
0
citations
MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes
CVPR 2025
0
citations
MangaNinja: Line Art Colorization with Precise Reference Following
CVPR 2025
0
citations
DiffDoctor: Diagnosing Image Diffusion Models Before Treating
ICCV 2025
0
citations
Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy
ICCV 2025
0
citations
ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing
ICCV 2025
0
citations
VACE: All-in-One Video Creation and Editing
ICCV 2025
0
citations
Pretrained Reversible Generation as Unsupervised Visual Representation Learning
ICCV 2025
0
citations
LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment
ICCV 2025
0
citations
UniFuse: A Unified All-in-One Framework for Multi-Modal Medical Image Fusion Under Diverse Degradations and Misalignments
ICCV 2025
0
citations
Improving Pointing Accuracy for 3D Target Selection in Virtual Reality through Depth Perception Biases Correction
ISMAR 2025
0
citations
As Pseudo-Label Free as Possible: Leveraging Adaptive Feature Generation for Sparsely Annotated Object Detection
AAAI 2025
0
citations
CI-STHPAN: Pre-trained Attention Network for Stock Selection with Channel-Independent Spatio-Temporal Hypergraph
AAAI 2024
0
citations
Critic-Guided Decision Transformer for Offline Reinforcement Learning
AAAI 2024
0
citations
GMP-AR: Granularity Message Passing and Adaptive Reconciliation for Temporal Hierarchy Forecasting
AAAI 2024arXiv
0
citations
Causality-Inspired Invariant Representation Learning for Text-Based Person Retrieval
AAAI 2024
0
citations
Check Locate Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation
CVPR 2024
0
citations
AnyDoor: Zero-shot Object-level Image Customization
CVPR 2024
0
citations
GLID: Pre-training a Generalist Encoder-Decoder Vision Model
CVPR 2024
0
citations
EasyDrag: Efficient Point-based Manipulation on Diffusion Models
CVPR 2024
0
citations
LMDrive: Closed-Loop End-to-End Driving with Large Language Models
CVPR 2024
0
citations
CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement
CVPR 2024
0
citations
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion
CVPR 2024
0
citations
Derivative Estimation in Random Design
NeurIPS 2018
0
citations
Large-batch Optimization for Dense Visual Predictions: Training Faster R-CNN in 4.2 Minutes
NeurIPS 2022
0
citations
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios
NeurIPS 2023
0
citations
RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths
NeurIPS 2023
0
citations
Customizable Image Synthesis with Multiple Subjects
NeurIPS 2023
0
citations
K-Means Clustering with Distributed Dimensions
ICML 2016
0
citations