Yanfeng Wang

25
Papers
128
Total Citations

Papers (25)

ReMamber: Referring Image Segmentation with Mamba Twister

ECCV 2024
49
citations

Audio-Visual Segmentation via Unlabeled Frame Exploitation

CVPR 2024
27
citations

Towards Universal Soccer Video Understanding

CVPR 2025
14
citations

Multi-Sentence Grounding for Long-term Instructional Video

ECCV 2024
12
citations

4DGC: Rate-Aware 4D Gaussian Compression for Efficient Streamable Free-Viewpoint Video

CVPR 2025
11
citations

On Harmonizing Implicit Subpopulations

ICLR 2024
8
citations

Learning to Instruct for Visual Instruction Tuning

NeurIPS 2025arXiv
3
citations

Differential-informed Sample Selection Accelerates Multimodal Contrastive Learning

ICCV 2025
2
citations

Fine-tuning with Reserved Majority for Noise Reduction

ICLR 2025
2
citations

Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images

CVPR 2024
0
citations

HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning

ICML 2024
0
citations

Q-value Regularized Transformer for Offline Reinforcement Learning

ICML 2024
0
citations

Locally Estimated Global Perturbations are Better than Local Perturbations for Federated Sharpness-aware Minimization

ICML 2024
0
citations

Self-Alignment of Large Language Models via Monopolylogue-based Social Scene Simulation

ICML 2024
0
citations

Exploring Training on Heterogeneous Data with Mixture of Low-rank Adapters

ICML 2024
0
citations

LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant

CVPR 2025
0
citations

Diversified Batch Selection for Training Acceleration

ICML 2024
0
citations

Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training

CVPR 2025
0
citations

MRGen: Segmentation Data Engine For Underrepresented MRI Modalities

ICCV 2025
0
citations

RAD: Towards Trustworthy Retrieval-Augmented Multi-modal Clinical Diagnosis

NeurIPS 2025
0
citations

VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression

AAAI 2025
0
citations

Low-Rank Knowledge Decomposition for Medical Foundation Models

CVPR 2024
0
citations

Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models

CVPR 2024
0
citations

Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents

CVPR 2024
0
citations

Mitigating Noisy Correspondence by Geometrical Structure Consistency Learning

CVPR 2024
0
citations