Meng Wang
35
Papers
216
Total Citations
Papers (35)
Prototypical Calibrating Ambiguous Samples for Micro-Action Recognition
AAAI 2025
41
citations
Frequency Decoupling for Motion Magnification via Multi-Level Isomorphic Architecture
CVPR 2024
35
citations
EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
CVPR 2025
28
citations
Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering
AAAI 2024arXiv
24
citations
StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models
ECCV 2024
22
citations
A Dual-Way Enhanced Framework from Text Matching Point of View for Multimodal Entity Linking
AAAI 2024arXiv
18
citations
Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis
CVPR 2024
12
citations
ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding
CVPR 2025
10
citations
TASAR: Transfer-based Attack on Skeletal Action Recognition
ICLR 2025
9
citations
MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights
AAAI 2025
6
citations
Visual-Oriented Fine-Grained Knowledge Editing for MultiModal Large Language Models
ICCV 2025
4
citations
FakeDiffer: Distributional Disparity Learning on Differentiated Reconstruction for Face Forgery Detection
AAAI 2025
3
citations
Boosting Adversarial Transferability via Residual Perturbation Attack
ICCV 2025
2
citations
Towards Efficient General Feature Prediction in Masked Skeleton Modeling
ICCV 2025arXiv
1
citations
GT-Mean Loss: A Simple Yet Effective Solution for Brightness Mismatch in Low-Light Image Enhancement
ICCV 2025
1
citations
Revisiting the Power of Prompt for Visual Tuning
ICML 2024
0
citations
Adaptive Group Personalization for Federated Mutual Transfer Learning
ICML 2024
0
citations
SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning
ICML 2024
0
citations
What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding
ICML 2024
0
citations
Vision-Language Model IP Protection via Prompt-based Learning
CVPR 2025arXiv
0
citations
How Do Nonlinear Transformers Learn and Generalize in In-Context Learning?
ICML 2024
0
citations
Revisiting Audio-Visual Segmentation with Vision-Centric Transformer
CVPR 2025
0
citations
Towards Open-Vocabulary Audio-Visual Event Localization
CVPR 2025
0
citations
SMoLoRA: Exploring and Defying Dual Catastrophic Forgetting in Continual Visual Instruction Tuning
ICCV 2025
0
citations
DistillDrive: End-to-End Multi-Mode Autonomous Driving Distillation by Isomorphic Hetero-Source Planning Model
ICCV 2025
0
citations
An Information-Theoretic Regularizer for Lossy Neural Image Compression
ICCV 2025
0
citations
MMAD: Multi-label Micro-Action Detection in Videos
ICCV 2025
0
citations
PhysDiff: Physiology-based Dynamicity Disentangled Diffusion Model for Remote Physiological Measurement
AAAI 2025
0
citations
VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene Completion
AAAI 2025
0
citations
Thinking in Granularity: Dynamic Quantization for Image Super-Resolution by Intriguing Multi-Granularity Clues
AAAI 2025
0
citations
EulerMormer: Robust Eulerian Motion Magnification via Dynamic Filtering within Transformer
AAAI 2024
0
citations
KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking
AAAI 2024
0
citations
Data-Free Quantization via Pseudo-label Filtering
CVPR 2024
0
citations
Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting
CVPR 2024
0
citations
A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts
ICML 2024
0
citations