Chunhua Shen
154
Papers
1,217
Total Citations
Papers (154)
Depth and Surface Normal Estimation From Monocular Images Using Regression on Deep Features and Hierarchical CRFs
CVPR 2015
628
citations
Efficient Semantic Video Segmentation with Per-frame Inference
ECCV 2020
138
citations
8976 PointAttN: You Only Need Attention for Point Cloud Completion
AAAI 2024
92
citations
Image Restoration Using Very Deep Convolutional Encoder-Decoder Networks with Symmetric Skip Connections
NeurIPS 2016arXiv
85
citations
Weighing Counts: Sequential Crowd Counting by Reinforcement Learning
ECCV 2020
80
citations
What Matters When Repurposing Diffusion Models for General Dense Perception Tasks?
ICLR 2025arXiv
56
citations
Aether: Geometric-Aware Unified World Modeling
ICCV 2025
47
citations
Representative Graph Neural Network
ECCV 2020
44
citations
Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration
NeurIPS 2025
12
citations
FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior
ECCV 2024
12
citations
SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories
CVPR 2025
11
citations
TG-LLaVA: Text Guided LLaVA via Learnable Latent Embeddings
AAAI 2025
8
citations
Revisiting Convolution Architecture in the Realm of DNA Foundation Models
ICLR 2025arXiv
4
citations
On the Trajectory Regularity of ODE-based Diffusion Sampling
ICML 2024
0
citations
Supervised Discrete Hashing
CVPR 2015
0
citations
Mid-Level Deep Pattern Mining
CVPR 2015
0
citations
Learning to Rank in Person Re-Identification With Metric Ensembles
CVPR 2015
0
citations
Efficient SDP Inference for Fully-Connected CRFs Based on Low-Rank Decomposition
CVPR 2015
0
citations
Learning Graph Structure for Multi-Label Image Classification via Clique Generation
CVPR 2015
0
citations
The Treasure Beneath Convolutional Layers: Cross-Convolutional-Layer Pooling for Image Classification
CVPR 2015
0
citations
Deep Convolutional Neural Fields for Depth Estimation From a Single Image
CVPR 2015
0
citations
What Value Do Explicit High Level Concepts Have in Vision to Language Problems?
CVPR 2016
0
citations
What's Wrong With That Object? Identifying Images of Unusual Objects by Modelling the Detection Score Distribution
CVPR 2016
0
citations
Less Is More: Zero-Shot Learning From Online Textual Documents With Noise Suppression
CVPR 2016
0
citations
Efficient Piecewise Training of Deep Structured Models for Semantic Segmentation
CVPR 2016
0
citations
Ask Me Anything: Free-Form Visual Question Answering Based on Knowledge From External Sources
CVPR 2016
0
citations
Fast Training of Triplet-Based Deep Binary Embedding Networks
CVPR 2016
0
citations
The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions
CVPR 2017
0
citations
Sequential Person Recognition in Photo Albums With a Recurrent Network
CVPR 2017arXiv
0
citations
Attend in Groups: A Weakly-Supervised Deep Learning Framework for Learning From Web Data
CVPR 2017arXiv
0
citations
RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation
CVPR 2017arXiv
0
citations
From Motion Blur to Motion Flow: A Deep Learning Solution for Removing Heterogeneous Motion Blur
CVPR 2017arXiv
0
citations
Multi-Attention Network for One Shot Learning
CVPR 2017
0
citations
Monocular Relative Depth Perception With Web Stereo Data Supervision
CVPR 2018
0
citations
Bootstrapping the Performance of Webly Supervised Semantic Segmentation
CVPR 2018
0
citations
FSRNet: End-to-End Learning Face Super-Resolution With Facial Priors
CVPR 2018arXiv
0
citations
Parallel Attention: A Unified Framework for Visual Object Discovery Through Dialogs and Queries
CVPR 2018arXiv
0
citations
An End-to-End TextSpotter With Explicit Alignment and Attention
CVPR 2018arXiv
0
citations
Are You Talking to Me? Reasoned Visual Dialog Generation Through Adversarial Learning
CVPR 2018arXiv
0
citations
Visual Question Answering With Memory-Augmented Networks
CVPR 2018arXiv
0
citations
Repulsion Loss: Detecting Pedestrians in a Crowd
CVPR 2018arXiv
0
citations
Towards Effective Low-Bitwidth Convolutional Neural Networks
CVPR 2018arXiv
0
citations
VITAL: VIsual Tracking via Adversarial Learning
CVPR 2018arXiv
0
citations
Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation
CVPR 2019
0
citations
Knowledge Adaptation for Efficient Semantic Segmentation
CVPR 2019
0
citations
Attention-Guided Network for Ghost-Free High Dynamic Range Imaging
CVPR 2019
0
citations
Neighbourhood Watch: Referring Expression Comprehension via Language-Guided Graph Attention Networks
CVPR 2019
0
citations
Mind Your Neighbours: Image Annotation With Metadata Neighbourhood Graph Co-Attention Networks
CVPR 2019
0
citations
Decoders Matter for Semantic Segmentation: Data-Dependent Decoding Enables Flexible Feature Aggregation
CVPR 2019
0
citations
Associatively Segmenting Instances and Semantics in Point Clouds
CVPR 2019
0
citations
CANet: Class-Agnostic Segmentation Networks With Iterative Refinement and Attentive Few-Shot Learning
CVPR 2019
0
citations
Visual Question Answering as Reading Comprehension
CVPR 2019
0
citations
Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells
CVPR 2019
0
citations
Training Quantized Neural Networks With a Full-Precision Auxiliary Module
CVPR 2020arXiv
0
citations
Memory-Efficient Hierarchical Neural Architecture Search for Image Denoising
CVPR 2020arXiv
0
citations
BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation
CVPR 2020arXiv
0
citations
DeepEMD: Few-Shot Image Classification With Differentiable Earth Mover's Distance and Structured Classifiers
CVPR 2020
0
citations
ABCNet: Real-Time Scene Text Spotting With Adaptive Bezier-Curve Network
CVPR 2020arXiv
0
citations
Context Prior for Scene Segmentation
CVPR 2020arXiv
0
citations
Mask Encoding for Single Shot Instance Segmentation
CVPR 2020arXiv
0
citations
NAS-FCOS: Fast Neural Architecture Search for Object Detection
CVPR 2020
0
citations
On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering
CVPR 2020arXiv
0
citations
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments
CVPR 2020arXiv
0
citations
Self-Trained Deep Ordinal Regression for End-to-End Video Anomaly Detection
CVPR 2020arXiv
0
citations
PolarMask: Single Shot Instance Segmentation With Polar Representation
CVPR 2020arXiv
0
citations
DoDNet: Learning To Segment Multi-Organ and Tumors From Multiple Partially Labeled Datasets
CVPR 2021arXiv
0
citations
Learning To Recover 3D Scene Shape From a Single Image
CVPR 2021arXiv
0
citations
Graph Attention Tracking
CVPR 2021arXiv
0
citations
AQD: Towards Accurate Quantized Object Detection
CVPR 2021arXiv
0
citations
Generic Perceptual Loss for Modeling Structured Output Dependencies
CVPR 2021arXiv
0
citations
DyCo3D: Robust Instance Segmentation of 3D Point Clouds Through Dynamic Convolution
CVPR 2021arXiv
0
citations
Learning Spatial-Semantic Relationship for Facial Attribute Recognition With Limited Labeled Data
CVPR 2021
0
citations
Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition
CVPR 2021arXiv
0
citations
End-to-End Video Instance Segmentation With Transformers
CVPR 2021arXiv
0
citations
Dense Contrastive Learning for Self-Supervised Visual Pre-Training
CVPR 2021arXiv
0
citations
FCPose: Fully Convolutional Multi-Person Pose Estimation With Dynamic Instance-Aware Convolutions
CVPR 2021arXiv
0
citations
BoxInst: High-Performance Instance Segmentation With Box Annotations
CVPR 2021arXiv
0
citations
HCRF-Flow: Scene Flow From Point Clouds With Continuous High-Order CRFs and Position-Aware Flow Embedding
CVPR 2021
0
citations
Learning Affinity-Aware Upsampling for Deep Image Matting
CVPR 2021arXiv
0
citations
FreeSOLO: Learning To Segment Objects Without Annotations
CVPR 2022arXiv
0
citations
RigidFlow: Self-Supervised Scene Flow Learning on Point Clouds by Local Rigidity Prior
CVPR 2022
0
citations
Catching Both Gray and Black Swans: Open-Set Supervised Anomaly Detection
CVPR 2022arXiv
0
citations
Retrieval Augmented Classification for Long-Tail Visual Recognition
CVPR 2022arXiv
0
citations
Boosting Robustness of Image Matting With Context Assembling and Strong Data Augmentation
CVPR 2022arXiv
0
citations
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation
CVPR 2022arXiv
0
citations
Learning Conditional Attributes for Compositional Zero-Shot Learning
CVPR 2023arXiv
0
citations
Images Speak in Images: A Generalist Painter for In-Context Visual Learning
CVPR 2023arXiv
0
citations
Hyperspectral Compressive Sensing Using Manifold-Structured Sparsity Prior
ICCV 2015
0
citations
Towards Context-Aware Interaction Recognition for Visual Relationship Detection
ICCV 2017
0
citations
When Unsupervised Domain Adaptation Meets Tensor Representations
ICCV 2017arXiv
0
citations
Adversarial PoseNet: A Structure-Aware Convolutional Network for Human Pose Estimation
ICCV 2017arXiv
0
citations
Towards End-To-End Text Spotting With Convolutional Recurrent Neural Networks
ICCV 2017arXiv
0
citations
Exploiting Temporal Consistency for Real-Time Video Depth Estimation
ICCV 2019
0
citations
Indices Matter: Learning to Index for Deep Image Matting
ICCV 2019
0
citations
Enforcing Geometric Constraints of Virtual Normal for Depth Prediction
ICCV 2019
0
citations
Self-Training With Progressive Augmentation for Unsupervised Cross-Domain Person Re-Identification
ICCV 2019
0
citations
From Open Set to Closed Set: Counting Objects by Spatial Divide-and-Conquer
ICCV 2019
0
citations
Efficient and Accurate Arbitrary-Shaped Text Detection With Pixel Aggregation Network
ICCV 2019
0
citations
FCOS: Fully Convolutional One-Stage Object Detection
ICCV 2019
0
citations
FATNN: Fast and Accurate Ternary Neural Networks
ICCV 2021arXiv
0
citations
BV-Person: A Large-Scale Dataset for Bird-View Person Re-Identification
ICCV 2021
0
citations
Channel-Wise Knowledge Distillation for Dense Prediction
ICCV 2021arXiv
0
citations
A Simple Baseline for Semi-Supervised Semantic Segmentation With Strong Data Augmentation
ICCV 2021arXiv
0
citations
Meta Navigator: Search for a Good Adaptation Policy for Few-Shot Learning
ICCV 2021arXiv
0
citations
Occluded Person Re-Identification With Single-Scale Global Representations
ICCV 2021
0
citations
DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
ICCV 2023arXiv
0
citations
SegGPT: Towards Segmenting Everything in Context
ICCV 2023
0
citations
Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering
ICCV 2023arXiv
0
citations
CTVIS: Consistent Training for Online Video Instance Segmentation
ICCV 2023arXiv
0
citations
Generative Prompt Model for Weakly Supervised Object Localization
ICCV 2023arXiv
0
citations
Zolly: Zoom Focal Length Correctly for Perspective-Distorted Human Mesh Reconstruction
ICCV 2023arXiv
0
citations
FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models
ICCV 2023arXiv
0
citations
SegPrompt: Boosting Open-World Segmentation via Category-Level Prompt Learning
ICCV 2023arXiv
0
citations
Conditional Convolutions for Instance Segmentation
ECCV 2020
0
citations
Soft Expert Reward Learning for Vision-and-Language Navigation
ECCV 2020
0
citations
Scene Text Image Super-resolution in the wild
ECCV 2020
0
citations
Segmenting Transparent Objects in the Wild
ECCV 2020
0
citations
AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting
ECCV 2020
0
citations
Learning and Memorizing Representative Prototypes for 3D Point Cloud Semantic and Instance Segmentation
ECCV 2020
0
citations
SOLO: Segmenting Objects by Locations
ECCV 2020
0
citations
Instance-Aware Embedding for Point Cloud Instance Segmentation
ECCV 2020
0
citations
PointInst3D: Segmenting 3D Instances by Points
ECCV 2022
0
citations
Poseur: Direct Human Pose Regression with Transformers
ECCV 2022
0
citations
Efficient Decoder-Free Object Detection with Transformers
ECCV 2022
0
citations
DisCo: Remedying Self-Supervised Learning on Lightweight Models with Distilled Contrastive Learning
ECCV 2022
0
citations
Deeply Learning the Messages in Message Passing Inference
NeurIPS 2015arXiv
0
citations
Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image
ICCV 2023arXiv
0
citations
MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
CVPR 2025
0
citations
POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction
ICCV 2025
0
citations
Fine-grained Abnormality Prompt Learning for Zero-shot Anomaly Detection
ICCV 2025
0
citations
SMSTracker: Tri-path Score Mask Sigma Fusion for Multi-Modal Tracking
ICCV 2025
0
citations
SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting
ICCV 2025
0
citations
Unified Open-World Segmentation with Multi-Modal Prompts
ICCV 2025
0
citations
Retrieval-Augmented Primitive Representations for Compositional Zero-Shot Learning
AAAI 2024
0
citations
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data
CVPR 2024
0
citations
Traffic Scene Parsing through the TSP6K Dataset
CVPR 2024
0
citations
FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
CVPR 2024
0
citations
Floating Anchor Diffusion Model for Multi-motif Scaffolding
ICML 2024
0
citations
Generative Active Learning for Long-tailed Instance Segmentation
ICML 2024
0
citations
Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video
NeurIPS 2019
0
citations
Multi-marginal Wasserstein GAN
NeurIPS 2019
0
citations
SOLOv2: Dynamic and Fast Instance Segmentation
NeurIPS 2020
0
citations
Twins: Revisiting the Design of Spatial Attention in Vision Transformers
NeurIPS 2021
0
citations
Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation
NeurIPS 2021
0
citations
SegViT: Semantic Segmentation with Plain Vision Transformers
NeurIPS 2022
0
citations
Hierarchical Normalization for Robust Monocular Depth Estimation
NeurIPS 2022
0
citations
Multi-dataset Training of Transformers for Robust Action Recognition
NeurIPS 2022
0
citations
DENSE: Data-Free One-Shot Federated Learning
NeurIPS 2022
0
citations
Adv-Attribute: Inconspicuous and Transferable Adversarial Attack on Face Recognition
NeurIPS 2022
0
citations
Fully Convolutional One-Stage 3D Object Detection on LiDAR Range Images
NeurIPS 2022
0
citations
PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining
NeurIPS 2022
0
citations
Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval
NeurIPS 2022
0
citations
DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models
NeurIPS 2023
0
citations
Adversarial Learning with Local Coordinate Coding
ICML 2018
0
citations