Yu Liu

45
Papers
594
Total Citations

Papers (45)

VACE: All-in-One Video Creation and Editing

ICCV 2025arXiv
169
citations

Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following

CVPR 2024
78
citations

Space Group Constrained Crystal Generation

ICLR 2024
60
citations

Universal Actions for Enhanced Embodied Foundation Models

CVPR 2025
42
citations

SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction

CVPR 2024
38
citations

Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance

CVPR 2024
35
citations

A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning

AAAI 2024arXiv
25
citations

Fast Context-Based Low-Light Image Enhancement via Neural Implicit Representations

ECCV 2024
24
citations

Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation

CVPR 2024
23
citations

Lipschitz Singularities in Diffusion Models

ICLR 2024
21
citations

Improved Video VAE for Latent Video Diffusion Model

CVPR 2025arXiv
19
citations

Decompositional Neural Scene Reconstruction with Generative Diffusion Prior

CVPR 2025
18
citations

Novel Class Discovery for Ultra-Fine-Grained Visual Categorization

CVPR 2024
13
citations

TACO: Taming Diffusion for in-the-wild Video Amodal Completion

ICCV 2025
9
citations

Elegantly Written: Disentangling Writer and Character Styles for Enhancing Online Chinese Handwriting

ECCV 2024
6
citations

IDEA-Bench: How Far are Generative Models from Professional Designing?

CVPR 2025
4
citations

BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs

CVPR 2025
3
citations

See Further When Clear: Curriculum Consistency Model

CVPR 2025
2
citations

NTR-Gaussian: Nighttime Dynamic Thermal Reconstruction with 4D Gaussian Splatting Based on Thermodynamics

CVPR 2025arXiv
2
citations

AUC Optimization from Multiple Unlabeled Datasets

AAAI 2024arXiv
2
citations

Effect Size Estimation for Duration Recommendation in Online Experiments: Leveraging Hierarchical Models and Objective Utility Approaches

AAAI 2024arXiv
1
citations

CI-STHPAN: Pre-trained Attention Network for Stock Selection with Channel-Independent Spatio-Temporal Hypergraph

AAAI 2024
0
citations

Critic-Guided Decision Transformer for Offline Reinforcement Learning

AAAI 2024
0
citations

GMP-AR: Granularity Message Passing and Adaptive Reconciliation for Temporal Hierarchy Forecasting

AAAI 2024arXiv
0
citations

Causality-Inspired Invariant Representation Learning for Text-Based Person Retrieval

AAAI 2024
0
citations

Pretrained Reversible Generation as Unsupervised Visual Representation Learning

ICCV 2025
0
citations

Check Locate Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation

CVPR 2024
0
citations

AnyDoor: Zero-shot Object-level Image Customization

CVPR 2024
0
citations

ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing

ICCV 2025
0
citations

GLID: Pre-training a Generalist Encoder-Decoder Vision Model

CVPR 2024
0
citations

EasyDrag: Efficient Point-based Manipulation on Diffusion Models

CVPR 2024
0
citations

LMDrive: Closed-Loop End-to-End Driving with Large Language Models

CVPR 2024
0
citations

CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement

CVPR 2024
0
citations

Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy

ICCV 2025
0
citations

DiffDoctor: Diagnosing Image Diffusion Models Before Treating

ICCV 2025
0
citations

DreamVideo: Composing Your Dream Videos with Customized Subject and Motion

CVPR 2024
0
citations

MangaNinja: Line Art Colorization with Precise Reference Following

CVPR 2025
0
citations

MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes

CVPR 2025
0
citations

StrokeNUWA—Tokenizing Strokes for Vector Graphic Synthesis

ICML 2024
0
citations

CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency Models

ICML 2024
0
citations

UniFuse: A Unified All-in-One Framework for Multi-Modal Medical Image Fusion Under Diverse Degradations and Misalignments

ICCV 2025
0
citations

DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning

ICML 2024
0
citations

LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment

ICCV 2025
0
citations

Improving Pointing Accuracy for 3D Target Selection in Virtual Reality through Depth Perception Biases Correction

ISMAR 2025
0
citations

As Pseudo-Label Free as Possible: Leveraging Adaptive Feature Generation for Sparsely Annotated Object Detection

AAAI 2025
0
citations