Xiang Wang

49
Papers
441
Total Citations

Papers (49)

InstructVideo: Instructing Video Diffusion Models with Human Feedback

CVPR 2024
80
citations

Towards 3D Molecule-Text Interpretation in Language Models

ICLR 2024
73
citations

Animate-X: Universal Character Image Animation with Enhanced Motion Representation

ICLR 2025
59
citations

Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation

CVPR 2024
55
citations

A Recipe for Scaling up Text-to-Video Generation with Text-free Videos

CVPR 2024
53
citations

Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model

CVPR 2024
37
citations

Language Representations Can be What Recommenders Need: Findings and Potentials

ICLR 2025
23
citations

Text-to-Image Generation for Abstract Concepts

AAAI 2024arXiv
21
citations

Reinforced Lifelong Editing for Language Models

ICML 2025
21
citations

HR-Pro: Point-Supervised Temporal Action Localization via Hierarchical Reliability Propagation

AAAI 2024arXiv
19
citations

SpikeZIP-TF: Conversion is All You Need for Transformer-based SNN

ICML 2024
0
citations

Improving Object Proposals With Multi-Thresholding Straddling Expansion

CVPR 2015
0
citations

Weakly-Supervised Semantic Segmentation by Iteratively Mining Common Object Features

CVPR 2018arXiv
0
citations

Temporal Context Aggregation Network for Temporal Action Proposal Refinement

CVPR 2021arXiv
0
citations

Self-Supervised Learning for Semi-Supervised Temporal Action Proposal

CVPR 2021arXiv
0
citations

Invariant Grounding for Video Question Answering

CVPR 2022
0
citations

Temporal Feature Alignment and Mutual Information Maximization for Video-Based Human Pose Estimation

CVPR 2022arXiv
0
citations

Learning From Untrimmed Videos: Self-Supervised Video Representation Learning With Hierarchical Consistency

CVPR 2022arXiv
0
citations

Hybrid Relation Guided Set Matching for Few-Shot Action Recognition

CVPR 2022arXiv
0
citations

Revisiting Domain Generalized Stereo Matching Networks From a Feature Consistency Perspective

CVPR 2022arXiv
0
citations

MoLo: Motion-Augmented Long-Short Contrastive Learning for Few-Shot Action Recognition

CVPR 2023arXiv
0
citations

Enlarging Instance-Specific and Class-Specific Information for Open-Set Action Recognition

CVPR 2023arXiv
0
citations

OadTR: Online Action Detection With Transformers

ICCV 2021arXiv
0
citations

Space-time Prompting for Video Class-incremental Learning

ICCV 2023
0
citations

Discovering Spatio-Temporal Rationales for Video Question Answering

ICCV 2023arXiv
0
citations

RLIPv2: Fast Scaling of Relational Language-Image Pre-Training

ICCV 2023arXiv
0
citations

Geometric Alignment and Prior Modulation for View-Guided Point Cloud Completion on Unseen Categories

ICCV 2025
0
citations

Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity

CVPR 2025
0
citations

PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation

ICCV 2025
0
citations

DreamRelation: Relation-Centric Video Customization

ICCV 2025
0
citations

FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion

ICCV 2025
0
citations

Intermediate Connectors and Geometric Priors for Language-Guided Affordance Segmentation on Unseen Object Categories

ICCV 2025
0
citations

Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters

CVPR 2025
0
citations

On Efficiency-Effectiveness Trade-off of Diffusion-based Recommenders

NeurIPS 2025
0
citations

LASO: Language-guided Affordance Segmentation on 3D Object

CVPR 2024
0
citations

Auctionformer: A Unified Deep Learning Algorithm for Solving Equilibrium Strategies in Auction Games

ICML 2024
0
citations

Explaining Landscape Connectivity of Low-cost Solutions for Multilayer Nets

NeurIPS 2019
0
citations

Beyond Lazy Training for Over-parameterized Tensor Decomposition

NeurIPS 2020
0
citations

Understanding Deflation Process in Over-parametrized Tensor Decomposition

NeurIPS 2021
0
citations

Towards Multi-Grained Explainability for Graph Neural Networks

NeurIPS 2021
0
citations

Incorporating Bias-aware Margins into Contrastive Loss for Collaborative Filtering

NeurIPS 2022
0
citations

Learning a Condensed Frame for Memory-Efficient Video Class-Incremental Learning

NeurIPS 2022
0
citations

Empowering Collaborative Filtering with Principled Adversarial Contrastive Loss

NeurIPS 2023
0
citations

VideoComposer: Compositional Video Synthesis with Motion Controllability

NeurIPS 2023
0
citations

Unleashing the Power of Graph Data Augmentation on Covariate Distribution Shift

NeurIPS 2023
0
citations

Understanding Contrastive Learning via Distributionally Robust Optimization

NeurIPS 2023
0
citations

Generate What You Prefer: Reshaping Sequential Recommendation via Guided Diffusion

NeurIPS 2023
0
citations

Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules

NeurIPS 2023
0
citations

Evaluating Post-hoc Explanations for Graph Neural Networks via Robustness Analysis

NeurIPS 2023
0
citations