Yan Wang
88
Papers
307
Total Citations
Papers (88)
A Powerful Generative Model Using Random Weights for the Deep Image Representation
NeurIPS 2016arXiv
79
citations
Unleashing the Potential of SAM for Medical Adaptation via Hierarchical Decoding
CVPR 2024
60
citations
Enabling Deep Residual Networks for Weakly Supervised Object Detection
ECCV 2020
49
citations
Language-Image Models with 3D Understanding
ICLR 2025
27
citations
MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes
ICCV 2025
21
citations
Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis
CVPR 2025
17
citations
MambaIC: State Space Models for High-Performance Learned Image Compression
CVPR 2025
14
citations
Rethinking Diffusion Posterior Sampling: From Conditional Score Estimator to Maximizing a Posterior
ICLR 2025
12
citations
Task-Aware Encoder Control for Deep Video Compression
CVPR 2024
8
citations
Partial Label Learning with a Partner
AAAI 2024
6
citations
Probability-Polarized Optimal Transport for Unsupervised Domain Adaptation
AAAI 2024
6
citations
Spatially-Variant Degradation Model for Dataset-free Super-resolution
ECCV 2024
3
citations
LLM4RSR: Large Language Models as Data Correctors for Robust Sequential Recommendation
AAAI 2025
2
citations
Physical-aware Neural Radiance Fields for Efficient Exposure Correction
AAAI 2025
2
citations
Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering
AAAI 2025
1
citations
Object Attribute Matters in Visual Question Answering
AAAI 2024
0
citations
Pixel-level Semantic Correspondence through Layout-aware Representation Learning and Multi-scale Matching Integration
CVPR 2024
0
citations
CAMixerSR: Only Details Need More "Attention"
CVPR 2024
0
citations
Boosting Neural Representations for Videos with a Conditional Decoder
CVPR 2024
0
citations
Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models
CVPR 2024
0
citations
Visual-Augmented Dynamic Semantic Prototype for Generative Zero-Shot Learning
CVPR 2024
0
citations
AdaRevD: Adaptive Patch Exiting Reversible Decoder Pushes the Limit of Image Deblurring
CVPR 2024
0
citations
CogAgent: A Visual Language Model for GUI Agents
CVPR 2024
0
citations
RepAn: Enhanced Annealing through Re-parameterization
CVPR 2024
0
citations
PARA-Drive: Parallelized Architecture for Real-time Autonomous Driving
CVPR 2024
0
citations
Correlation-Decoupled Knowledge Distillation for Multimodal Sentiment Analysis with Incomplete Modalities
CVPR 2024
0
citations
An Embodied Generalist Agent in 3D World
ICML 2024
0
citations
DeepContour: A Deep Convolutional Feature Learned by Positive-Sharing Loss for Contour Detection
CVPR 2015
0
citations
Object Skeleton Extraction in Natural Images by Fusing Scale-Associated Deep Side Outputs
CVPR 2016
0
citations
Deep Regression Forests for Age Estimation
CVPR 2018arXiv
0
citations
Generative Adversarial Learning Towards Fast Weakly Supervised Detection
CVPR 2018
0
citations
Resource Aware Person Re-Identification Across Multiple Resolutions
CVPR 2018arXiv
0
citations
Recurrent Saliency Transformation Network: Incorporating Multi-Stage Visual Cues for Small Organ Segmentation
CVPR 2018arXiv
0
citations
Cyclic Guidance for Weakly Supervised Joint Detection and Segmentation
CVPR 2019
0
citations
Fully Quantized Network for Object Detection
CVPR 2019
0
citations
Pseudo-LiDAR From Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving
CVPR 2019
0
citations
Deep Distance Transform for Tubular Structure Segmentation in CT Scans
CVPR 2020arXiv
0
citations
HRank: Filter Pruning Using High-Rank Feature Map
CVPR 2020arXiv
0
citations
End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection
CVPR 2020arXiv
0
citations
Train in Germany, Test in the USA: Making 3D Object Detectors Generalize
CVPR 2020arXiv
0
citations
Checkerboard Context Model for Efficient Learned Image Compression
CVPR 2021arXiv
0
citations
ContrastMask: Contrastive Learning To Segment Every Thing
CVPR 2022arXiv
0
citations
ELIC: Efficient Learned Image Compression With Unevenly Grouped Space-Channel Contextual Adaptive Coding
CVPR 2022arXiv
0
citations
Ithaca365: Dataset and Driving Perception Under Repeated and Challenging Weather Conditions
CVPR 2022
0
citations
Practical Learned Lossless JPEG Recompression With Multi-Level Cross-Channel Entropy Model in the DCT Domain
CVPR 2022arXiv
0
citations
Class Balanced Adaptive Pseudo Labeling for Federated Semi-Supervised Learning
CVPR 2023
0
citations
Privacy-Preserving Adversarial Facial Features
CVPR 2023arXiv
0
citations
MagicNet: Semi-Supervised Multi-Organ Segmentation via Magic-Cube Partition and Recovery
CVPR 2023arXiv
0
citations
Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation
CVPR 2023arXiv
0
citations
Meta Architecture for Point Cloud Analysis
CVPR 2023arXiv
0
citations
SORT: Second-Order Response Transform for Visual Recognition
ICCV 2017arXiv
0
citations
Multi-Stage Multi-Recursive-Input Fully Convolutional Networks for Neuronal Boundary Detection
ICCV 2017arXiv
0
citations
Recognition of Action Units in the Wild With Deep Nets and a New Global-Local Loss
ICCV 2017
0
citations
Deep Co-Training With Task Decomposition for Semi-Supervised Domain Adaptation
ICCV 2021arXiv
0
citations
AIDE: A Vision-Driven Multi-View, Multi-Modal, Multi-Tasking Dataset for Assistive Driving Perception
ICCV 2023arXiv
0
citations
Efficient Decision-based Black-box Patch Attacks on Video Recognition
ICCV 2023arXiv
0
citations
Rethinking Safe Semi-supervised Learning: Transferring the Open-set Problem to A Close-set One
ICCV 2023
0
citations
Automatic Network Pruning via Hilbert-Schmidt Independence Criterion Lasso under Information Bottleneck Principle
ICCV 2023
0
citations
RIBAC: Towards Robust and Imperceptible Backdoor Attack against Compact DNN
ECCV 2022
0
citations
Black-Box Dissector: Towards Erasing-Based Hard-Label Model Stealing Attack
ECCV 2022
0
citations
FERV39k: A Large-Scale Multi-Scene Dataset for Facial Expression Recognition in Videos
CVPR 2022arXiv
0
citations
Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding
CVPR 2025
0
citations
PICD: Versatile Perceptual Image Compression with Diffusion Rendering
CVPR 2025
0
citations
D2SP: Dynamic Dual-Stage Purification Framework for Dual Noise Mitigation in Vision-based Affective Recognition.
CVPR 2025
0
citations
Medusa: A Multi-Scale High-order Contrastive Dual-Diffusion Approach for Multi-View Clustering
CVPR 2025
0
citations
Extrapolated Urban View Synthesis Benchmark
ICCV 2025
0
citations
MamV2XCalib: V2X-based Target-less Infrastructure Camera Calibration with State Space Model
ICCV 2025
0
citations
OUS: Bridging Scene Context and Facial Features to Overcome the Rigid Cognitive Problem
AAAI 2025
0
citations
CAMSIC: Content-aware Masked Image Modeling Transformer for Stereo Image Compression
AAAI 2025
0
citations
GapMatch: Bridging Instance and Model Perturbations for Enhanced Semi-Supervised Medical Image Segmentation
AAAI 2025
0
citations
Variable Importance in High-Dimensional Settings Requires Grouping
AAAI 2024
0
citations
Fine-Tuning Large Language Model Based Explainable Recommendation with Explainable Quality Reward
AAAI 2024
0
citations
A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image
AAAI 2024
0
citations
LLMRG: Improving Recommendations through Large Language Model Reasoning Graphs
AAAI 2024
0
citations
Collaborative Consortium of Foundation Models for Open-World Few-Shot Learning
AAAI 2024
0
citations
Variational Structured Semantic Inference for Diverse Image Captioning
NeurIPS 2019
0
citations
Rotated Binary Neural Network
NeurIPS 2020
0
citations
Wasserstein Distances for Stereo Disparity Estimation
NeurIPS 2020
0
citations
Neural Surface Reconstruction of Dynamic Scenes with Monocular RGB-D Camera
NeurIPS 2022
0
citations
Multi-Sample Training for Neural Image Compression
NeurIPS 2022
0
citations
Flexible Neural Image Compression via Code Editing
NeurIPS 2022
0
citations
A Contrastive Framework for Neural Text Generation
NeurIPS 2022
0
citations
Theoretically Guaranteed Bidirectional Data Rectification for Robust Sequential Recommendation
NeurIPS 2023
0
citations
Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research
NeurIPS 2023
0
citations
Idempotent Learned Image Compression with Right-Inverse
NeurIPS 2023
0
citations
Prompt-augmented Temporal Point Process for Streaming Event Sequence
NeurIPS 2023
0
citations
Stability of Random Forests and Coverage of Random-Forest Prediction Intervals
NeurIPS 2023
0
citations
Exploiting Contextual Objects and Relations for 3D Visual Grounding
NeurIPS 2023
0
citations