Yu Liu
45
Papers
594
Total Citations
Papers (45)
VACE: All-in-One Video Creation and Editing
ICCV 2025arXiv
169
citations
Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following
CVPR 2024
78
citations
Space Group Constrained Crystal Generation
ICLR 2024
60
citations
Universal Actions for Enhanced Embodied Foundation Models
CVPR 2025
42
citations
SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction
CVPR 2024
38
citations
Rethinking the Spatial Inconsistency in Classifier-Free Diffusion Guidance
CVPR 2024
35
citations
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
AAAI 2024arXiv
25
citations
Fast Context-Based Low-Light Image Enhancement via Neural Implicit Representations
ECCV 2024
24
citations
Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation
CVPR 2024
23
citations
Lipschitz Singularities in Diffusion Models
ICLR 2024
21
citations
Improved Video VAE for Latent Video Diffusion Model
CVPR 2025arXiv
19
citations
Decompositional Neural Scene Reconstruction with Generative Diffusion Prior
CVPR 2025
18
citations
Novel Class Discovery for Ultra-Fine-Grained Visual Categorization
CVPR 2024
13
citations
TACO: Taming Diffusion for in-the-wild Video Amodal Completion
ICCV 2025
9
citations
Elegantly Written: Disentangling Writer and Character Styles for Enhancing Online Chinese Handwriting
ECCV 2024
6
citations
IDEA-Bench: How Far are Generative Models from Professional Designing?
CVPR 2025
4
citations
BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs
CVPR 2025
3
citations
See Further When Clear: Curriculum Consistency Model
CVPR 2025
2
citations
NTR-Gaussian: Nighttime Dynamic Thermal Reconstruction with 4D Gaussian Splatting Based on Thermodynamics
CVPR 2025arXiv
2
citations
AUC Optimization from Multiple Unlabeled Datasets
AAAI 2024arXiv
2
citations
Effect Size Estimation for Duration Recommendation in Online Experiments: Leveraging Hierarchical Models and Objective Utility Approaches
AAAI 2024arXiv
1
citations
CI-STHPAN: Pre-trained Attention Network for Stock Selection with Channel-Independent Spatio-Temporal Hypergraph
AAAI 2024
0
citations
Critic-Guided Decision Transformer for Offline Reinforcement Learning
AAAI 2024
0
citations
GMP-AR: Granularity Message Passing and Adaptive Reconciliation for Temporal Hierarchy Forecasting
AAAI 2024arXiv
0
citations
Causality-Inspired Invariant Representation Learning for Text-Based Person Retrieval
AAAI 2024
0
citations
Pretrained Reversible Generation as Unsupervised Visual Representation Learning
ICCV 2025
0
citations
Check Locate Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation
CVPR 2024
0
citations
AnyDoor: Zero-shot Object-level Image Customization
CVPR 2024
0
citations
ICE-Bench: A Unified and Comprehensive Benchmark for Image Creating and Editing
ICCV 2025
0
citations
GLID: Pre-training a Generalist Encoder-Decoder Vision Model
CVPR 2024
0
citations
EasyDrag: Efficient Point-based Manipulation on Diffusion Models
CVPR 2024
0
citations
LMDrive: Closed-Loop End-to-End Driving with Large Language Models
CVPR 2024
0
citations
CPGA: Coding Priors-Guided Aggregation Network for Compressed Video Quality Enhancement
CVPR 2024
0
citations
Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy
ICCV 2025
0
citations
DiffDoctor: Diagnosing Image Diffusion Models Before Treating
ICCV 2025
0
citations
DreamVideo: Composing Your Dream Videos with Customized Subject and Motion
CVPR 2024
0
citations
MangaNinja: Line Art Colorization with Precise Reference Following
CVPR 2025
0
citations
MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes
CVPR 2025
0
citations
StrokeNUWA—Tokenizing Strokes for Vector Graphic Synthesis
ICML 2024
0
citations
CCM: Real-Time Controllable Visual Content Creation Using Text-to-Image Consistency Models
ICML 2024
0
citations
UniFuse: A Unified All-in-One Framework for Multi-Modal Medical Image Fusion Under Diverse Degradations and Misalignments
ICCV 2025
0
citations
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning
ICML 2024
0
citations
LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment
ICCV 2025
0
citations
Improving Pointing Accuracy for 3D Target Selection in Virtual Reality through Depth Perception Biases Correction
ISMAR 2025
0
citations
As Pseudo-Label Free as Possible: Leveraging Adaptive Feature Generation for Sparsely Annotated Object Detection
AAAI 2025
0
citations