Lei Wang

23
Papers
144
Total Citations

Papers (23)

S2WAT: Image Style Transfer via Hierarchical Vision Transformer Using Strips Window Attention

AAAI 2024arXiv
46
citations

View-decoupled Transformer for Person Re-identification under Aerial-ground Camera Network

CVPR 2024
32
citations

Unlocking Multimodal Mathematical Reasoning via Process Reward Model

NeurIPS 2025arXiv
29
citations

Learnable Expansion of Graph Operators for Multi-Modal Feature Fusion

ICLR 2025
16
citations

Attention-Driven GUI Grounding: Leveraging Pretrained Multimodal Large Language Models Without Fine-Tuning

AAAI 2025
9
citations

Elegantly Written: Disentangling Writer and Character Styles for Enhancing Online Chinese Handwriting

ECCV 2024
6
citations

Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability

CVPR 2025arXiv
2
citations

Correlative and Discriminative Label Grouping for Multi-Label Visual Prompt Tuning

CVPR 2025
2
citations

Graph Your Own Prompt

NeurIPS 2025
1
citations

Puzzles: Unbounded Video-Depth Augmentation for Scalable End-to-End 3D Reconstruction

NeurIPS 2025arXiv
1
citations

AUEditNet: Dual-Branch Facial Action Unit Intensity Manipulation with Implicit Disentanglement

CVPR 2024
0
citations

Ditto: Quantization-aware Secure Inference of Transformers upon MPC

ICML 2024
0
citations

One-Way Ticket: Time-Independent Unified Encoder for Distilling Text-to-Image Diffusion Models

CVPR 2025
0
citations

Taylor Videos for Action Recognition

ICML 2024
0
citations

Text-Driven Fashion Image Editing with Compositional Concept Learning and Counterfactual Abduction

CVPR 2025
0
citations

Visual Representation Learning through Causal Intervention for Controllable Image Editing

CVPR 2025
0
citations

Enhancing Few-Shot Class-Incremental Learning via Training-Free Bi-Level Modality Calibration

CVPR 2025
0
citations

LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement

ICCV 2025
0
citations

Dynamic Multi-Layer Null Space Projection for Vision-Language Continual Learning

ICCV 2025
0
citations

FedEL: Federated Elastic Learning for Heterogeneous Devices

NeurIPS 2025
0
citations

T-SciQ: Teaching Multimodal Chain-of-Thought Reasoning via Large Language Model Signals for Science Question Answering

AAAI 2024
0
citations

Roll with the Punches: Expansion and Shrinkage of Soft Label Selection for Semi-supervised Fine-Grained Learning

AAAI 2024
0
citations

Exploiting Inter-sample and Inter-feature Relations in Dataset Distillation

CVPR 2024
0
citations