Meng Wang

35
Papers
216
Total Citations

Papers (35)

Prototypical Calibrating Ambiguous Samples for Micro-Action Recognition

AAAI 2025
41
citations

Frequency Decoupling for Motion Magnification via Multi-Level Isomorphic Architecture

CVPR 2024
35
citations

EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering

CVPR 2025
28
citations

Object-Aware Adaptive-Positivity Learning for Audio-Visual Question Answering

AAAI 2024arXiv
24
citations

StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models

ECCV 2024
22
citations

A Dual-Way Enhanced Framework from Text Matching Point of View for Multimodal Entity Linking

AAAI 2024arXiv
18
citations

Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis

CVPR 2024
12
citations

ASAP: Advancing Semantic Alignment Promotes Multi-Modal Manipulation Detecting and Grounding

CVPR 2025
10
citations

TASAR: Transfer-based Attack on Skeletal Action Recognition

ICLR 2025
9
citations

MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights

AAAI 2025
6
citations

Visual-Oriented Fine-Grained Knowledge Editing for MultiModal Large Language Models

ICCV 2025
4
citations

FakeDiffer: Distributional Disparity Learning on Differentiated Reconstruction for Face Forgery Detection

AAAI 2025
3
citations

Boosting Adversarial Transferability via Residual Perturbation Attack

ICCV 2025
2
citations

Towards Efficient General Feature Prediction in Masked Skeleton Modeling

ICCV 2025arXiv
1
citations

GT-Mean Loss: A Simple Yet Effective Solution for Brightness Mismatch in Low-Light Image Enhancement

ICCV 2025
1
citations

Revisiting the Power of Prompt for Visual Tuning

ICML 2024
0
citations

Adaptive Group Personalization for Federated Mutual Transfer Learning

ICML 2024
0
citations

SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning

ICML 2024
0
citations

What Improves the Generalization of Graph Transformers? A Theoretical Dive into the Self-attention and Positional Encoding

ICML 2024
0
citations

Vision-Language Model IP Protection via Prompt-based Learning

CVPR 2025arXiv
0
citations

How Do Nonlinear Transformers Learn and Generalize in In-Context Learning?

ICML 2024
0
citations

Revisiting Audio-Visual Segmentation with Vision-Centric Transformer

CVPR 2025
0
citations

Towards Open-Vocabulary Audio-Visual Event Localization

CVPR 2025
0
citations

SMoLoRA: Exploring and Defying Dual Catastrophic Forgetting in Continual Visual Instruction Tuning

ICCV 2025
0
citations

DistillDrive: End-to-End Multi-Mode Autonomous Driving Distillation by Isomorphic Hetero-Source Planning Model

ICCV 2025
0
citations

An Information-Theoretic Regularizer for Lossy Neural Image Compression

ICCV 2025
0
citations

MMAD: Multi-label Micro-Action Detection in Videos

ICCV 2025
0
citations

PhysDiff: Physiology-based Dynamicity Disentangled Diffusion Model for Remote Physiological Measurement

AAAI 2025
0
citations

VLScene: Vision-Language Guidance Distillation for Camera-Based 3D Semantic Scene Completion

AAAI 2025
0
citations

Thinking in Granularity: Dynamic Quantization for Image Super-Resolution by Intriguing Multi-Granularity Clues

AAAI 2025
0
citations

EulerMormer: Robust Eulerian Motion Magnification via Dynamic Filtering within Transformer

AAAI 2024
0
citations

KPA-Tracker: Towards Robust and Real-Time Category-Level Articulated Object 6D Pose Tracking

AAAI 2024
0
citations

Data-Free Quantization via Pseudo-label Filtering

CVPR 2024
0
citations

Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting

CVPR 2024
0
citations

A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts

ICML 2024
0
citations