Yan Wang

88
Papers
307
Total Citations

Papers (88)

A Powerful Generative Model Using Random Weights for the Deep Image Representation

NeurIPS 2016arXiv
79
citations

Unleashing the Potential of SAM for Medical Adaptation via Hierarchical Decoding

CVPR 2024
60
citations

Enabling Deep Residual Networks for Weakly Supervised Object Detection

ECCV 2020
49
citations

Language-Image Models with 3D Understanding

ICLR 2025
27
citations

MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes

ICCV 2025
21
citations

Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis

CVPR 2025
17
citations

MambaIC: State Space Models for High-Performance Learned Image Compression

CVPR 2025
14
citations

Rethinking Diffusion Posterior Sampling: From Conditional Score Estimator to Maximizing a Posterior

ICLR 2025
12
citations

Task-Aware Encoder Control for Deep Video Compression

CVPR 2024
8
citations

Partial Label Learning with a Partner

AAAI 2024
6
citations

Probability-Polarized Optimal Transport for Unsupervised Domain Adaptation

AAAI 2024
6
citations

Spatially-Variant Degradation Model for Dataset-free Super-resolution

ECCV 2024
3
citations

LLM4RSR: Large Language Models as Data Correctors for Robust Sequential Recommendation

AAAI 2025
2
citations

Physical-aware Neural Radiance Fields for Efficient Exposure Correction

AAAI 2025
2
citations

Multimodal Hypothetical Summary for Retrieval-based Multi-image Question Answering

AAAI 2025
1
citations

Object Attribute Matters in Visual Question Answering

AAAI 2024
0
citations

Pixel-level Semantic Correspondence through Layout-aware Representation Learning and Multi-scale Matching Integration

CVPR 2024
0
citations

CAMixerSR: Only Details Need More "Attention"

CVPR 2024
0
citations

Boosting Neural Representations for Videos with a Conditional Decoder

CVPR 2024
0
citations

Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models

CVPR 2024
0
citations

Visual-Augmented Dynamic Semantic Prototype for Generative Zero-Shot Learning

CVPR 2024
0
citations

AdaRevD: Adaptive Patch Exiting Reversible Decoder Pushes the Limit of Image Deblurring

CVPR 2024
0
citations

CogAgent: A Visual Language Model for GUI Agents

CVPR 2024
0
citations

RepAn: Enhanced Annealing through Re-parameterization

CVPR 2024
0
citations

PARA-Drive: Parallelized Architecture for Real-time Autonomous Driving

CVPR 2024
0
citations

Correlation-Decoupled Knowledge Distillation for Multimodal Sentiment Analysis with Incomplete Modalities

CVPR 2024
0
citations

An Embodied Generalist Agent in 3D World

ICML 2024
0
citations

DeepContour: A Deep Convolutional Feature Learned by Positive-Sharing Loss for Contour Detection

CVPR 2015
0
citations

Object Skeleton Extraction in Natural Images by Fusing Scale-Associated Deep Side Outputs

CVPR 2016
0
citations

Deep Regression Forests for Age Estimation

CVPR 2018arXiv
0
citations

Generative Adversarial Learning Towards Fast Weakly Supervised Detection

CVPR 2018
0
citations

Resource Aware Person Re-Identification Across Multiple Resolutions

CVPR 2018arXiv
0
citations

Recurrent Saliency Transformation Network: Incorporating Multi-Stage Visual Cues for Small Organ Segmentation

CVPR 2018arXiv
0
citations

Cyclic Guidance for Weakly Supervised Joint Detection and Segmentation

CVPR 2019
0
citations

Fully Quantized Network for Object Detection

CVPR 2019
0
citations

Pseudo-LiDAR From Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving

CVPR 2019
0
citations

Deep Distance Transform for Tubular Structure Segmentation in CT Scans

CVPR 2020arXiv
0
citations

HRank: Filter Pruning Using High-Rank Feature Map

CVPR 2020arXiv
0
citations

End-to-End Pseudo-LiDAR for Image-Based 3D Object Detection

CVPR 2020arXiv
0
citations

Train in Germany, Test in the USA: Making 3D Object Detectors Generalize

CVPR 2020arXiv
0
citations

Checkerboard Context Model for Efficient Learned Image Compression

CVPR 2021arXiv
0
citations

ContrastMask: Contrastive Learning To Segment Every Thing

CVPR 2022arXiv
0
citations

ELIC: Efficient Learned Image Compression With Unevenly Grouped Space-Channel Contextual Adaptive Coding

CVPR 2022arXiv
0
citations

Ithaca365: Dataset and Driving Perception Under Repeated and Challenging Weather Conditions

CVPR 2022
0
citations

Practical Learned Lossless JPEG Recompression With Multi-Level Cross-Channel Entropy Model in the DCT Domain

CVPR 2022arXiv
0
citations

Class Balanced Adaptive Pseudo Labeling for Federated Semi-Supervised Learning

CVPR 2023
0
citations

Privacy-Preserving Adversarial Facial Features

CVPR 2023arXiv
0
citations

MagicNet: Semi-Supervised Multi-Organ Segmentation via Magic-Cube Partition and Recovery

CVPR 2023arXiv
0
citations

Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation

CVPR 2023arXiv
0
citations

Meta Architecture for Point Cloud Analysis

CVPR 2023arXiv
0
citations

SORT: Second-Order Response Transform for Visual Recognition

ICCV 2017arXiv
0
citations

Multi-Stage Multi-Recursive-Input Fully Convolutional Networks for Neuronal Boundary Detection

ICCV 2017arXiv
0
citations

Recognition of Action Units in the Wild With Deep Nets and a New Global-Local Loss

ICCV 2017
0
citations

Deep Co-Training With Task Decomposition for Semi-Supervised Domain Adaptation

ICCV 2021arXiv
0
citations

AIDE: A Vision-Driven Multi-View, Multi-Modal, Multi-Tasking Dataset for Assistive Driving Perception

ICCV 2023arXiv
0
citations

Efficient Decision-based Black-box Patch Attacks on Video Recognition

ICCV 2023arXiv
0
citations

Rethinking Safe Semi-supervised Learning: Transferring the Open-set Problem to A Close-set One

ICCV 2023
0
citations

Automatic Network Pruning via Hilbert-Schmidt Independence Criterion Lasso under Information Bottleneck Principle

ICCV 2023
0
citations

RIBAC: Towards Robust and Imperceptible Backdoor Attack against Compact DNN

ECCV 2022
0
citations

Black-Box Dissector: Towards Erasing-Based Hard-Label Model Stealing Attack

ECCV 2022
0
citations

FERV39k: A Large-Scale Multi-Scene Dataset for Facial Expression Recognition in Videos

CVPR 2022arXiv
0
citations

Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding

CVPR 2025
0
citations

PICD: Versatile Perceptual Image Compression with Diffusion Rendering

CVPR 2025
0
citations

D2SP: Dynamic Dual-Stage Purification Framework for Dual Noise Mitigation in Vision-based Affective Recognition.

CVPR 2025
0
citations

Medusa: A Multi-Scale High-order Contrastive Dual-Diffusion Approach for Multi-View Clustering

CVPR 2025
0
citations

Extrapolated Urban View Synthesis Benchmark

ICCV 2025
0
citations

MamV2XCalib: V2X-based Target-less Infrastructure Camera Calibration with State Space Model

ICCV 2025
0
citations

OUS: Bridging Scene Context and Facial Features to Overcome the Rigid Cognitive Problem

AAAI 2025
0
citations

CAMSIC: Content-aware Masked Image Modeling Transformer for Stereo Image Compression

AAAI 2025
0
citations

GapMatch: Bridging Instance and Model Perturbations for Enhanced Semi-Supervised Medical Image Segmentation

AAAI 2025
0
citations

Variable Importance in High-Dimensional Settings Requires Grouping

AAAI 2024
0
citations

Fine-Tuning Large Language Model Based Explainable Recommendation with Explainable Quality Reward

AAAI 2024
0
citations

A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image

AAAI 2024
0
citations

LLMRG: Improving Recommendations through Large Language Model Reasoning Graphs

AAAI 2024
0
citations

Collaborative Consortium of Foundation Models for Open-World Few-Shot Learning

AAAI 2024
0
citations

Variational Structured Semantic Inference for Diverse Image Captioning

NeurIPS 2019
0
citations

Rotated Binary Neural Network

NeurIPS 2020
0
citations

Wasserstein Distances for Stereo Disparity Estimation

NeurIPS 2020
0
citations

Neural Surface Reconstruction of Dynamic Scenes with Monocular RGB-D Camera

NeurIPS 2022
0
citations

Multi-Sample Training for Neural Image Compression

NeurIPS 2022
0
citations

Flexible Neural Image Compression via Code Editing

NeurIPS 2022
0
citations

A Contrastive Framework for Neural Text Generation

NeurIPS 2022
0
citations

Theoretically Guaranteed Bidirectional Data Rectification for Robust Sequential Recommendation

NeurIPS 2023
0
citations

Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research

NeurIPS 2023
0
citations

Idempotent Learned Image Compression with Right-Inverse

NeurIPS 2023
0
citations

Prompt-augmented Temporal Point Process for Streaming Event Sequence

NeurIPS 2023
0
citations

Stability of Random Forests and Coverage of Random-Forest Prediction Intervals

NeurIPS 2023
0
citations

Exploiting Contextual Objects and Relations for 3D Visual Grounding

NeurIPS 2023
0
citations