Wei Liu
33
Papers
621
Total Citations
1
Affiliations
Affiliations
The Hong Kong University of Science and Technology
Papers (33)
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
ICLR 2024
343
citations
BadCLIP: Trigger-Aware Prompt Learning for Backdoor Attacks on CLIP
CVPR 2024
68
citations
Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving
NeurIPS 2025
46
citations
MathAttack: Attacking Large Language Models towards Math Solving Ability
AAAI 2024arXiv
37
citations
IDOL: Instant Photorealistic 3D Human Creation from a Single Image
CVPR 2025arXiv
36
citations
STIV: Scalable Text and Image Conditioned Video Generation
ICCV 2025
20
citations
MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal Controls
AAAI 2025
19
citations
Local Conditional Controlling for Text-to-Image Diffusion Models
AAAI 2025
13
citations
GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization
AAAI 2025
9
citations
Auto-Regressive Diffusion for Generating 3D Human-Object Interactions
AAAI 2025
6
citations
EBMDock: Neural Probabilistic Protein-Protein Docking via a Differentiable Energy Model
ICLR 2024
5
citations
Adversarial Cooperative Rationalization: The Risk of Spurious Correlations in Even Clean Datasets
ICML 2025
5
citations
Fix-CLIP: Dual-Branch Hierarchical Contrastive Learning via Synthetic Captions for Better Understanding of Long Text
ICCV 2025
5
citations
Enhancing Human Experience in Human-Agent Collaboration: A Human-Centered Modeling Approach Based on Positive Human Gain
ICLR 2024
4
citations
ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering
ICCV 2025
3
citations
Quantifying Distributional Invariance in Causal Subgraph for IRM-Free Graph Generalization
NeurIPS 2025
1
citations
Stability and Generalization of Zeroth-Order Decentralized Stochastic Gradient Descent with Changing Topology
AAAI 2025
1
citations
UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation
ICML 2025
0
citations
Enhancing Testing-Time Robustness for Trusted Multi-View Classification in the Wild
CVPR 2025
0
citations
WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions
ICCV 2025
0
citations
GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions
ICCV 2025
0
citations
HarmonySeg: Tubular Structure Segmentation with Deep-Shallow Feature Fusion and Growth-Suppression Balanced Loss
ICCV 2025
0
citations
ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area
AAAI 2025
0
citations
Towards More Discriminative Feature Learning in SNNs with Temporal-Self-Erasing Supervision
AAAI 2025
0
citations
Infinite-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation
AAAI 2025
0
citations
Follow-Your-Click: Open-domain Regional Image Animation via Motion Prompts
AAAI 2025
0
citations
Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm
AAAI 2025
0
citations
Modeling All Response Surfaces in One for Conditional Search Spaces
AAAI 2025
0
citations
Enhancing Multi-View Classification Reliability with Adaptive Rejection
AAAI 2025
0
citations
Decoupling Representation and Knowledge for Few-Shot Intent Classification and Slot Filling
AAAI 2024
0
citations
DreamIdentity: Enhanced Editability for Efficient Face-Identity Preserved Image Generation
AAAI 2024
0
citations
SeqGPT: An Out-of-the-Box Large Language Model for Open Domain Sequence Understanding
AAAI 2024arXiv
0
citations
Modality-Agnostic Structural Image Representation Learning for Deformable Multi-Modality Medical Image Registration
CVPR 2024
0
citations