Yan Wang
39
Papers
179
Total Citations
Papers (39)
Unleashing the Potential of SAM for Medical Adaptation via Hierarchical Decoding
CVPR 2024
60
citations
Language-Image Models with 3D Understanding
ICLR 2025
27
citations
MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes
ICCV 2025
21
citations
Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis
CVPR 2025
17
citations
MambaIC: State Space Models for High-Performance Learned Image Compression
CVPR 2025
14
citations
Rethinking Diffusion Posterior Sampling: From Conditional Score Estimator to Maximizing a Posterior
ICLR 2025
12
citations
Task-Aware Encoder Control for Deep Video Compression
CVPR 2024
8
citations
Probability-Polarized Optimal Transport for Unsupervised Domain Adaptation
AAAI 2024
6
citations
Partial Label Learning with a Partner
AAAI 2024
6
citations
Spatially-Variant Degradation Model for Dataset-free Super-resolution
ECCV 2024
3
citations
LLM4RSR: Large Language Models as Data Correctors for Robust Sequential Recommendation
AAAI 2025
2
citations
Physical-aware Neural Radiance Fields for Efficient Exposure Correction
AAAI 2025
2
citations
Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering
AAAI 2025
1
citations
LLMRG: Improving Recommendations through Large Language Model Reasoning Graphs
AAAI 2024
0
citations
Collaborative Consortium of Foundation Models for Open-World Few-Shot Learning
AAAI 2024
0
citations
Object Attribute Matters in Visual Question Answering
AAAI 2024
0
citations
Pixel-level Semantic Correspondence through Layout-aware Representation Learning and Multi-scale Matching Integration
CVPR 2024
0
citations
CAMixerSR: Only Details Need More "Attention"
CVPR 2024
0
citations
Boosting Neural Representations for Videos with a Conditional Decoder
CVPR 2024
0
citations
Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models
CVPR 2024
0
citations
Visual-Augmented Dynamic Semantic Prototype for Generative Zero-Shot Learning
CVPR 2024
0
citations
AdaRevD: Adaptive Patch Exiting Reversible Decoder Pushes the Limit of Image Deblurring
CVPR 2024
0
citations
CogAgent: A Visual Language Model for GUI Agents
CVPR 2024
0
citations
RepAn: Enhanced Annealing through Re-parameterization
CVPR 2024
0
citations
PARA-Drive: Parallelized Architecture for Real-time Autonomous Driving
CVPR 2024
0
citations
Correlation-Decoupled Knowledge Distillation for Multimodal Sentiment Analysis with Incomplete Modalities
CVPR 2024
0
citations
An Embodied Generalist Agent in 3D World
ICML 2024
0
citations
Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding
CVPR 2025
0
citations
PICD: Versatile Perceptual Image Compression with Diffusion Rendering
CVPR 2025
0
citations
D2SP: Dynamic Dual-Stage Purification Framework for Dual Noise Mitigation in Vision-based Affective Recognition.
CVPR 2025
0
citations
Medusa: A Multi-Scale High-order Contrastive Dual-Diffusion Approach for Multi-View Clustering
CVPR 2025
0
citations
Extrapolated Urban View Synthesis Benchmark
ICCV 2025
0
citations
MamV2XCalib: V2X-based Target-less Infrastructure Camera Calibration with State Space Model
ICCV 2025
0
citations
OUS: Bridging Scene Context and Facial Features to Overcome the Rigid Cognitive Problem
AAAI 2025
0
citations
CAMSIC: Content-aware Masked Image Modeling Transformer for Stereo Image Compression
AAAI 2025
0
citations
GapMatch: Bridging Instance and Model Perturbations for Enhanced Semi-Supervised Medical Image Segmentation
AAAI 2025
0
citations
Variable Importance in High-Dimensional Settings Requires Grouping
AAAI 2024
0
citations
Fine-Tuning Large Language Model Based Explainable Recommendation with Explainable Quality Reward
AAAI 2024
0
citations
A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image
AAAI 2024
0
citations