Yanfeng Wang

48
Papers
337
Total Citations

Papers (48)

Bottom-Up Temporal Action Localization with Mutual Regularization

ECCV 2020
209
citations

ReMamber: Referring Image Segmentation with Mamba Twister

ECCV 2024
49
citations

Audio-Visual Segmentation via Unlabeled Frame Exploitation

CVPR 2024
27
citations

Towards Universal Soccer Video Understanding

CVPR 2025
14
citations

Multi-Sentence Grounding for Long-term Instructional Video

ECCV 2024
12
citations

4DGC: Rate-Aware 4D Gaussian Compression for Efficient Streamable Free-Viewpoint Video

CVPR 2025
11
citations

On Harmonizing Implicit Subpopulations

ICLR 2024
8
citations

Learning to Instruct for Visual Instruction Tuning

NeurIPS 2025
3
citations

Differential-informed Sample Selection Accelerates Multimodal Contrastive Learning

ICCV 2025
2
citations

Fine-tuning with Reserved Majority for Noise Reduction

ICLR 2025
2
citations

HarmoDT: Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning

ICML 2024
0
citations

Q-value Regularized Transformer for Offline Reinforcement Learning

ICML 2024
0
citations

Locally Estimated Global Perturbations are Better than Local Perturbations for Federated Sharpness-aware Minimization

ICML 2024
0
citations

Self-Alignment of Large Language Models via Monopolylogue-based Social Scene Simulation

ICML 2024
0
citations

Exploring Training on Heterogeneous Data with Mixture of Low-rank Adapters

ICML 2024
0
citations

Diversified Batch Selection for Training Acceleration

ICML 2024
0
citations

Transferable Interactiveness Knowledge for Human-Object Interaction Detection

CVPR 2019
0
citations

Actional-Structural Graph Convolutional Networks for Skeleton-Based Action Recognition

CVPR 2019
0
citations

Dynamic Multiscale Graph Neural Networks for 3D Skeleton Based Human Motion Prediction

CVPR 2020arXiv
0
citations

Iteratively-Refined Interactive 3D Medical Image Segmentation With Multi-Agent Reinforcement Learning

CVPR 2020arXiv
0
citations

A Fourier-Based Framework for Domain Generalization

CVPR 2021arXiv
0
citations

Distilling Vision-Language Pre-Training To Collaborate With Weakly-Supervised Temporal Action Localization

CVPR 2023arXiv
0
citations

Leapfrog Diffusion Model for Stochastic Trajectory Prediction

CVPR 2023arXiv
0
citations

Collaboration Helps Camera Overtake LiDAR in 3D Detection

CVPR 2023arXiv
0
citations

EqMotion: Equivariant Multi-Agent Motion Prediction With Invariant Interaction Reasoning

CVPR 2023arXiv
0
citations

DR2: Diffusion-Based Robust Degradation Remover for Blind Face Restoration

CVPR 2023arXiv
0
citations

LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant

CVPR 2025
0
citations

Accelerate CNN via Recursive Bayesian Pruning

ICCV 2019
0
citations

H2O: A Benchmark for Visual Human-Human Object Handover Analysis

ICCV 2021arXiv
0
citations

Divide and Conquer for Single-Frame Temporal Action Localization

ICCV 2021
0
citations

MedKLIP: Medical Knowledge Enhanced Language-Image Pre-Training for X-ray Diagnosis

ICCV 2023
0
citations

Joint-Relation Transformer for Multi-Person Motion Prediction

ICCV 2023arXiv
0
citations

Open-vocabulary Object Segmentation with Diffusion Models

ICCV 2023arXiv
0
citations

Auxiliary Tasks Benefit 3D Skeleton-based Human Motion Prediction

ICCV 2023arXiv
0
citations

Federated Domain Generalization With Generalization Adjustment

CVPR 2023
0
citations

Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training

CVPR 2025
0
citations

MRGen: Segmentation Data Engine For Underrepresented MRI Modalities

ICCV 2025
0
citations

RAD: Towards Trustworthy Retrieval-Augmented Multi-modal Clinical Diagnosis

NeurIPS 2025
0
citations

VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression

AAAI 2025
0
citations

Low-Rank Knowledge Decomposition for Medical Foundation Models

CVPR 2024
0
citations

Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models

CVPR 2024
0
citations

Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents

CVPR 2024
0
citations

Mitigating Noisy Correspondence by Geometrical Structure Consistency Learning

CVPR 2024
0
citations

Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images

CVPR 2024
0
citations

AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation

NeurIPS 2023
0
citations

Combating Representation Learning Disparity with Geometric Harmonization

NeurIPS 2023
0
citations

Federated Learning with Bilateral Curation for Partially Class-Disjoint Data

NeurIPS 2023
0
citations

Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation

NeurIPS 2023
0
citations