Yan Wang

39
Papers
179
Total Citations

Papers (39)

Unleashing the Potential of SAM for Medical Adaptation via Hierarchical Decoding

CVPR 2024
60
citations

Language-Image Models with 3D Understanding

ICLR 2025
27
citations

MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes

ICCV 2025
21
citations

Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis

CVPR 2025
17
citations

MambaIC: State Space Models for High-Performance Learned Image Compression

CVPR 2025
14
citations

Rethinking Diffusion Posterior Sampling: From Conditional Score Estimator to Maximizing a Posterior

ICLR 2025
12
citations

Task-Aware Encoder Control for Deep Video Compression

CVPR 2024
8
citations

Probability-Polarized Optimal Transport for Unsupervised Domain Adaptation

AAAI 2024
6
citations

Partial Label Learning with a Partner

AAAI 2024
6
citations

Spatially-Variant Degradation Model for Dataset-free Super-resolution

ECCV 2024
3
citations

LLM4RSR: Large Language Models as Data Correctors for Robust Sequential Recommendation

AAAI 2025
2
citations

Physical-aware Neural Radiance Fields for Efficient Exposure Correction

AAAI 2025
2
citations

Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering

AAAI 2025
1
citations

LLMRG: Improving Recommendations through Large Language Model Reasoning Graphs

AAAI 2024
0
citations

Collaborative Consortium of Foundation Models for Open-World Few-Shot Learning

AAAI 2024
0
citations

Object Attribute Matters in Visual Question Answering

AAAI 2024
0
citations

Pixel-level Semantic Correspondence through Layout-aware Representation Learning and Multi-scale Matching Integration

CVPR 2024
0
citations

CAMixerSR: Only Details Need More "Attention"

CVPR 2024
0
citations

Boosting Neural Representations for Videos with a Conditional Decoder

CVPR 2024
0
citations

Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models

CVPR 2024
0
citations

Visual-Augmented Dynamic Semantic Prototype for Generative Zero-Shot Learning

CVPR 2024
0
citations

AdaRevD: Adaptive Patch Exiting Reversible Decoder Pushes the Limit of Image Deblurring

CVPR 2024
0
citations

CogAgent: A Visual Language Model for GUI Agents

CVPR 2024
0
citations

RepAn: Enhanced Annealing through Re-parameterization

CVPR 2024
0
citations

PARA-Drive: Parallelized Architecture for Real-time Autonomous Driving

CVPR 2024
0
citations

Correlation-Decoupled Knowledge Distillation for Multimodal Sentiment Analysis with Incomplete Modalities

CVPR 2024
0
citations

An Embodied Generalist Agent in 3D World

ICML 2024
0
citations

Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding

CVPR 2025
0
citations

PICD: Versatile Perceptual Image Compression with Diffusion Rendering

CVPR 2025
0
citations

D2SP: Dynamic Dual-Stage Purification Framework for Dual Noise Mitigation in Vision-based Affective Recognition.

CVPR 2025
0
citations

Medusa: A Multi-Scale High-order Contrastive Dual-Diffusion Approach for Multi-View Clustering

CVPR 2025
0
citations

Extrapolated Urban View Synthesis Benchmark

ICCV 2025
0
citations

MamV2XCalib: V2X-based Target-less Infrastructure Camera Calibration with State Space Model

ICCV 2025
0
citations

OUS: Bridging Scene Context and Facial Features to Overcome the Rigid Cognitive Problem

AAAI 2025
0
citations

CAMSIC: Content-aware Masked Image Modeling Transformer for Stereo Image Compression

AAAI 2025
0
citations

GapMatch: Bridging Instance and Model Perturbations for Enhanced Semi-Supervised Medical Image Segmentation

AAAI 2025
0
citations

Variable Importance in High-Dimensional Settings Requires Grouping

AAAI 2024
0
citations

Fine-Tuning Large Language Model Based Explainable Recommendation with Explainable Quality Reward

AAAI 2024
0
citations

A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image

AAAI 2024
0
citations