Xiang Wang
49
Papers
441
Total Citations
Papers (49)
InstructVideo: Instructing Video Diffusion Models with Human Feedback
CVPR 2024
80
citations
Towards 3D Molecule-Text Interpretation in Language Models
ICLR 2024
73
citations
Animate-X: Universal Character Image Animation with Enhanced Motion Representation
ICLR 2025
59
citations
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
CVPR 2024
55
citations
A Recipe for Scaling up Text-to-Video Generation with Text-free Videos
CVPR 2024
53
citations
Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model
CVPR 2024
37
citations
Language Representations Can be What Recommenders Need: Findings and Potentials
ICLR 2025
23
citations
Text-to-Image Generation for Abstract Concepts
AAAI 2024arXiv
21
citations
Reinforced Lifelong Editing for Language Models
ICML 2025
21
citations
HR-Pro: Point-Supervised Temporal Action Localization via Hierarchical Reliability Propagation
AAAI 2024arXiv
19
citations
SpikeZIP-TF: Conversion is All You Need for Transformer-based SNN
ICML 2024
0
citations
Improving Object Proposals With Multi-Thresholding Straddling Expansion
CVPR 2015
0
citations
Weakly-Supervised Semantic Segmentation by Iteratively Mining Common Object Features
CVPR 2018arXiv
0
citations
Temporal Context Aggregation Network for Temporal Action Proposal Refinement
CVPR 2021arXiv
0
citations
Self-Supervised Learning for Semi-Supervised Temporal Action Proposal
CVPR 2021arXiv
0
citations
Invariant Grounding for Video Question Answering
CVPR 2022
0
citations
Temporal Feature Alignment and Mutual Information Maximization for Video-Based Human Pose Estimation
CVPR 2022arXiv
0
citations
Learning From Untrimmed Videos: Self-Supervised Video Representation Learning With Hierarchical Consistency
CVPR 2022arXiv
0
citations
Hybrid Relation Guided Set Matching for Few-Shot Action Recognition
CVPR 2022arXiv
0
citations
Revisiting Domain Generalized Stereo Matching Networks From a Feature Consistency Perspective
CVPR 2022arXiv
0
citations
MoLo: Motion-Augmented Long-Short Contrastive Learning for Few-Shot Action Recognition
CVPR 2023arXiv
0
citations
Enlarging Instance-Specific and Class-Specific Information for Open-Set Action Recognition
CVPR 2023arXiv
0
citations
OadTR: Online Action Detection With Transformers
ICCV 2021arXiv
0
citations
Space-time Prompting for Video Class-incremental Learning
ICCV 2023
0
citations
Discovering Spatio-Temporal Rationales for Video Question Answering
ICCV 2023arXiv
0
citations
RLIPv2: Fast Scaling of Relational Language-Image Pre-Training
ICCV 2023arXiv
0
citations
Geometric Alignment and Prior Modulation for View-Guided Point Cloud Completion on Unseen Categories
ICCV 2025
0
citations
Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity
CVPR 2025
0
citations
PersonalVideo: High ID-Fidelity Video Customization without Dynamic and Semantic Degradation
ICCV 2025
0
citations
DreamRelation: Relation-Centric Video Customization
ICCV 2025
0
citations
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
ICCV 2025
0
citations
Intermediate Connectors and Geometric Priors for Language-Guided Affordance Segmentation on Unseen Object Categories
ICCV 2025
0
citations
Precise, Fast, and Low-cost Concept Erasure in Value Space: Orthogonal Complement Matters
CVPR 2025
0
citations
On Efficiency-Effectiveness Trade-off of Diffusion-based Recommenders
NeurIPS 2025
0
citations
LASO: Language-guided Affordance Segmentation on 3D Object
CVPR 2024
0
citations
Auctionformer: A Unified Deep Learning Algorithm for Solving Equilibrium Strategies in Auction Games
ICML 2024
0
citations
Explaining Landscape Connectivity of Low-cost Solutions for Multilayer Nets
NeurIPS 2019
0
citations
Beyond Lazy Training for Over-parameterized Tensor Decomposition
NeurIPS 2020
0
citations
Understanding Deflation Process in Over-parametrized Tensor Decomposition
NeurIPS 2021
0
citations
Towards Multi-Grained Explainability for Graph Neural Networks
NeurIPS 2021
0
citations
Incorporating Bias-aware Margins into Contrastive Loss for Collaborative Filtering
NeurIPS 2022
0
citations
Learning a Condensed Frame for Memory-Efficient Video Class-Incremental Learning
NeurIPS 2022
0
citations
Empowering Collaborative Filtering with Principled Adversarial Contrastive Loss
NeurIPS 2023
0
citations
VideoComposer: Compositional Video Synthesis with Motion Controllability
NeurIPS 2023
0
citations
Unleashing the Power of Graph Data Augmentation on Covariate Distribution Shift
NeurIPS 2023
0
citations
Understanding Contrastive Learning via Distributionally Robust Optimization
NeurIPS 2023
0
citations
Generate What You Prefer: Reshaping Sequential Recommendation via Guided Diffusion
NeurIPS 2023
0
citations
Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules
NeurIPS 2023
0
citations
Evaluating Post-hoc Explanations for Graph Neural Networks via Robustness Analysis
NeurIPS 2023
0
citations