Wei Liu

33
Papers
621
Total Citations
1
Affiliations

Affiliations

The Hong Kong University of Science and Technology

Papers (33)

LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

ICLR 2024
343
citations

BadCLIP: Trigger-Aware Prompt Learning for Backdoor Attacks on CLIP

CVPR 2024
68
citations

Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving

NeurIPS 2025
46
citations

MathAttack: Attacking Large Language Models towards Math Solving Ability

AAAI 2024arXiv
37
citations

IDOL: Instant Photorealistic 3D Human Creation from a Single Image

CVPR 2025arXiv
36
citations

STIV: Scalable Text and Image Conditioned Video Generation

ICCV 2025
20
citations

MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal Controls

AAAI 2025
19
citations

Local Conditional Controlling for Text-to-Image Diffusion Models

AAAI 2025
13
citations

GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization

AAAI 2025
9
citations

Auto-Regressive Diffusion for Generating 3D Human-Object Interactions

AAAI 2025
6
citations

EBMDock: Neural Probabilistic Protein-Protein Docking via a Differentiable Energy Model

ICLR 2024
5
citations

Adversarial Cooperative Rationalization: The Risk of Spurious Correlations in Even Clean Datasets

ICML 2025
5
citations

Fix-CLIP: Dual-Branch Hierarchical Contrastive Learning via Synthetic Captions for Better Understanding of Long Text

ICCV 2025
5
citations

Enhancing Human Experience in Human-Agent Collaboration: A Human-Centered Modeling Approach Based on Positive Human Gain

ICLR 2024
4
citations

ETVA: Evaluation of Text-to-Video Alignment via Fine-grained Question Generation and Answering

ICCV 2025
3
citations

Quantifying Distributional Invariance in Causal Subgraph for IRM-Free Graph Generalization

NeurIPS 2025
1
citations

Stability and Generalization of Zeroth-Order Decentralized Stochastic Gradient Descent with Changing Topology

AAAI 2025
1
citations

UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation

ICML 2025
0
citations

Enhancing Testing-Time Robustness for Trusted Multi-View Classification in the Wild

CVPR 2025
0
citations

WonderPlay: Dynamic 3D Scene Generation from a Single Image and Actions

ICCV 2025
0
citations

GraspCoT: Integrating Physical Property Reasoning for 6-DoF Grasping under Flexible Language Instructions

ICCV 2025
0
citations

HarmonySeg: Tubular Structure Segmentation with Deep-Shallow Feature Fusion and Growth-Suppression Balanced Loss

ICCV 2025
0
citations

ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area

AAAI 2025
0
citations

Towards More Discriminative Feature Learning in SNNs with Temporal-Self-Erasing Supervision

AAAI 2025
0
citations

Infinite-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation

AAAI 2025
0
citations

Follow-Your-Click: Open-domain Regional Image Animation via Motion Prompts

AAAI 2025
0
citations

Just a Few Glances: Open-Set Visual Perception with Image Prompt Paradigm

AAAI 2025
0
citations

Modeling All Response Surfaces in One for Conditional Search Spaces

AAAI 2025
0
citations

Enhancing Multi-View Classification Reliability with Adaptive Rejection

AAAI 2025
0
citations

Decoupling Representation and Knowledge for Few-Shot Intent Classification and Slot Filling

AAAI 2024
0
citations

DreamIdentity: Enhanced Editability for Efficient Face-Identity Preserved Image Generation

AAAI 2024
0
citations

SeqGPT: An Out-of-the-Box Large Language Model for Open Domain Sequence Understanding

AAAI 2024arXiv
0
citations

Modality-Agnostic Structural Image Representation Learning for Deformable Multi-Modality Medical Image Registration

CVPR 2024
0
citations