Chunhua Shen

154
Papers
1,217
Total Citations

Papers (154)

Depth and Surface Normal Estimation From Monocular Images Using Regression on Deep Features and Hierarchical CRFs

CVPR 2015
628
citations

Efficient Semantic Video Segmentation with Per-frame Inference

ECCV 2020
138
citations

8976 PointAttN: You Only Need Attention for Point Cloud Completion

AAAI 2024
92
citations

Image Restoration Using Very Deep Convolutional Encoder-Decoder Networks with Symmetric Skip Connections

NeurIPS 2016arXiv
85
citations

Weighing Counts: Sequential Crowd Counting by Reinforcement Learning

ECCV 2020
80
citations

What Matters When Repurposing Diffusion Models for General Dense Perception Tasks?

ICLR 2025arXiv
56
citations

Aether: Geometric-Aware Unified World Modeling

ICCV 2025
47
citations

Representative Graph Neural Network

ECCV 2020
44
citations

Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration

NeurIPS 2025
12
citations

FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior

ECCV 2024
12
citations

SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

CVPR 2025
11
citations

TG-LLaVA: Text Guided LLaVA via Learnable Latent Embeddings

AAAI 2025
8
citations

Revisiting Convolution Architecture in the Realm of DNA Foundation Models

ICLR 2025arXiv
4
citations

On the Trajectory Regularity of ODE-based Diffusion Sampling

ICML 2024
0
citations

Supervised Discrete Hashing

CVPR 2015
0
citations

Mid-Level Deep Pattern Mining

CVPR 2015
0
citations

Learning to Rank in Person Re-Identification With Metric Ensembles

CVPR 2015
0
citations

Efficient SDP Inference for Fully-Connected CRFs Based on Low-Rank Decomposition

CVPR 2015
0
citations

Learning Graph Structure for Multi-Label Image Classification via Clique Generation

CVPR 2015
0
citations

The Treasure Beneath Convolutional Layers: Cross-Convolutional-Layer Pooling for Image Classification

CVPR 2015
0
citations

Deep Convolutional Neural Fields for Depth Estimation From a Single Image

CVPR 2015
0
citations

What Value Do Explicit High Level Concepts Have in Vision to Language Problems?

CVPR 2016
0
citations

What's Wrong With That Object? Identifying Images of Unusual Objects by Modelling the Detection Score Distribution

CVPR 2016
0
citations

Less Is More: Zero-Shot Learning From Online Textual Documents With Noise Suppression

CVPR 2016
0
citations

Efficient Piecewise Training of Deep Structured Models for Semantic Segmentation

CVPR 2016
0
citations

Ask Me Anything: Free-Form Visual Question Answering Based on Knowledge From External Sources

CVPR 2016
0
citations

Fast Training of Triplet-Based Deep Binary Embedding Networks

CVPR 2016
0
citations

The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions

CVPR 2017
0
citations

Sequential Person Recognition in Photo Albums With a Recurrent Network

CVPR 2017arXiv
0
citations

Attend in Groups: A Weakly-Supervised Deep Learning Framework for Learning From Web Data

CVPR 2017arXiv
0
citations

RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation

CVPR 2017arXiv
0
citations

From Motion Blur to Motion Flow: A Deep Learning Solution for Removing Heterogeneous Motion Blur

CVPR 2017arXiv
0
citations

Multi-Attention Network for One Shot Learning

CVPR 2017
0
citations

Monocular Relative Depth Perception With Web Stereo Data Supervision

CVPR 2018
0
citations

Bootstrapping the Performance of Webly Supervised Semantic Segmentation

CVPR 2018
0
citations

FSRNet: End-to-End Learning Face Super-Resolution With Facial Priors

CVPR 2018arXiv
0
citations

Parallel Attention: A Unified Framework for Visual Object Discovery Through Dialogs and Queries

CVPR 2018arXiv
0
citations

An End-to-End TextSpotter With Explicit Alignment and Attention

CVPR 2018arXiv
0
citations

Are You Talking to Me? Reasoned Visual Dialog Generation Through Adversarial Learning

CVPR 2018arXiv
0
citations

Visual Question Answering With Memory-Augmented Networks

CVPR 2018arXiv
0
citations

Repulsion Loss: Detecting Pedestrians in a Crowd

CVPR 2018arXiv
0
citations

Towards Effective Low-Bitwidth Convolutional Neural Networks

CVPR 2018arXiv
0
citations

VITAL: VIsual Tracking via Adversarial Learning

CVPR 2018arXiv
0
citations

Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation

CVPR 2019
0
citations

Knowledge Adaptation for Efficient Semantic Segmentation

CVPR 2019
0
citations

Attention-Guided Network for Ghost-Free High Dynamic Range Imaging

CVPR 2019
0
citations

Neighbourhood Watch: Referring Expression Comprehension via Language-Guided Graph Attention Networks

CVPR 2019
0
citations

Mind Your Neighbours: Image Annotation With Metadata Neighbourhood Graph Co-Attention Networks

CVPR 2019
0
citations

Decoders Matter for Semantic Segmentation: Data-Dependent Decoding Enables Flexible Feature Aggregation

CVPR 2019
0
citations

Associatively Segmenting Instances and Semantics in Point Clouds

CVPR 2019
0
citations

CANet: Class-Agnostic Segmentation Networks With Iterative Refinement and Attentive Few-Shot Learning

CVPR 2019
0
citations

Visual Question Answering as Reading Comprehension

CVPR 2019
0
citations

Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells

CVPR 2019
0
citations

Training Quantized Neural Networks With a Full-Precision Auxiliary Module

CVPR 2020arXiv
0
citations

Memory-Efficient Hierarchical Neural Architecture Search for Image Denoising

CVPR 2020arXiv
0
citations

BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation

CVPR 2020arXiv
0
citations

DeepEMD: Few-Shot Image Classification With Differentiable Earth Mover's Distance and Structured Classifiers

CVPR 2020
0
citations

ABCNet: Real-Time Scene Text Spotting With Adaptive Bezier-Curve Network

CVPR 2020arXiv
0
citations

Context Prior for Scene Segmentation

CVPR 2020arXiv
0
citations

Mask Encoding for Single Shot Instance Segmentation

CVPR 2020arXiv
0
citations

NAS-FCOS: Fast Neural Architecture Search for Object Detection

CVPR 2020
0
citations

On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering

CVPR 2020arXiv
0
citations

REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments

CVPR 2020arXiv
0
citations

Self-Trained Deep Ordinal Regression for End-to-End Video Anomaly Detection

CVPR 2020arXiv
0
citations

PolarMask: Single Shot Instance Segmentation With Polar Representation

CVPR 2020arXiv
0
citations

DoDNet: Learning To Segment Multi-Organ and Tumors From Multiple Partially Labeled Datasets

CVPR 2021arXiv
0
citations

Learning To Recover 3D Scene Shape From a Single Image

CVPR 2021arXiv
0
citations

Graph Attention Tracking

CVPR 2021arXiv
0
citations

AQD: Towards Accurate Quantized Object Detection

CVPR 2021arXiv
0
citations

Generic Perceptual Loss for Modeling Structured Output Dependencies

CVPR 2021arXiv
0
citations

DyCo3D: Robust Instance Segmentation of 3D Point Clouds Through Dynamic Convolution

CVPR 2021arXiv
0
citations

Learning Spatial-Semantic Relationship for Facial Attribute Recognition With Limited Labeled Data

CVPR 2021
0
citations

Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition

CVPR 2021arXiv
0
citations

End-to-End Video Instance Segmentation With Transformers

CVPR 2021arXiv
0
citations

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

CVPR 2021arXiv
0
citations

FCPose: Fully Convolutional Multi-Person Pose Estimation With Dynamic Instance-Aware Convolutions

CVPR 2021arXiv
0
citations

BoxInst: High-Performance Instance Segmentation With Box Annotations

CVPR 2021arXiv
0
citations

HCRF-Flow: Scene Flow From Point Clouds With Continuous High-Order CRFs and Position-Aware Flow Embedding

CVPR 2021
0
citations

Learning Affinity-Aware Upsampling for Deep Image Matting

CVPR 2021arXiv
0
citations

FreeSOLO: Learning To Segment Objects Without Annotations

CVPR 2022arXiv
0
citations

RigidFlow: Self-Supervised Scene Flow Learning on Point Clouds by Local Rigidity Prior

CVPR 2022
0
citations

Catching Both Gray and Black Swans: Open-Set Supervised Anomaly Detection

CVPR 2022arXiv
0
citations

Retrieval Augmented Classification for Long-Tail Visual Recognition

CVPR 2022arXiv
0
citations

Boosting Robustness of Image Matting With Context Assembling and Strong Data Augmentation

CVPR 2022arXiv
0
citations

TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation

CVPR 2022arXiv
0
citations

Learning Conditional Attributes for Compositional Zero-Shot Learning

CVPR 2023arXiv
0
citations

Images Speak in Images: A Generalist Painter for In-Context Visual Learning

CVPR 2023arXiv
0
citations

Hyperspectral Compressive Sensing Using Manifold-Structured Sparsity Prior

ICCV 2015
0
citations

Towards Context-Aware Interaction Recognition for Visual Relationship Detection

ICCV 2017
0
citations

When Unsupervised Domain Adaptation Meets Tensor Representations

ICCV 2017arXiv
0
citations

Adversarial PoseNet: A Structure-Aware Convolutional Network for Human Pose Estimation

ICCV 2017arXiv
0
citations

Towards End-To-End Text Spotting With Convolutional Recurrent Neural Networks

ICCV 2017arXiv
0
citations

Exploiting Temporal Consistency for Real-Time Video Depth Estimation

ICCV 2019
0
citations

Indices Matter: Learning to Index for Deep Image Matting

ICCV 2019
0
citations

Enforcing Geometric Constraints of Virtual Normal for Depth Prediction

ICCV 2019
0
citations

Self-Training With Progressive Augmentation for Unsupervised Cross-Domain Person Re-Identification

ICCV 2019
0
citations

From Open Set to Closed Set: Counting Objects by Spatial Divide-and-Conquer

ICCV 2019
0
citations

Efficient and Accurate Arbitrary-Shaped Text Detection With Pixel Aggregation Network

ICCV 2019
0
citations

FCOS: Fully Convolutional One-Stage Object Detection

ICCV 2019
0
citations

FATNN: Fast and Accurate Ternary Neural Networks

ICCV 2021arXiv
0
citations

BV-Person: A Large-Scale Dataset for Bird-View Person Re-Identification

ICCV 2021
0
citations

Channel-Wise Knowledge Distillation for Dense Prediction

ICCV 2021arXiv
0
citations

A Simple Baseline for Semi-Supervised Semantic Segmentation With Strong Data Augmentation

ICCV 2021arXiv
0
citations

Meta Navigator: Search for a Good Adaptation Policy for Few-Shot Learning

ICCV 2021arXiv
0
citations

Occluded Person Re-Identification With Single-Scale Global Representations

ICCV 2021
0
citations

DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models

ICCV 2023arXiv
0
citations

SegGPT: Towards Segmenting Everything in Context

ICCV 2023
0
citations

Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering

ICCV 2023arXiv
0
citations

CTVIS: Consistent Training for Online Video Instance Segmentation

ICCV 2023arXiv
0
citations

Generative Prompt Model for Weakly Supervised Object Localization

ICCV 2023arXiv
0
citations

Zolly: Zoom Focal Length Correctly for Perspective-Distorted Human Mesh Reconstruction

ICCV 2023arXiv
0
citations

FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models

ICCV 2023arXiv
0
citations

SegPrompt: Boosting Open-World Segmentation via Category-Level Prompt Learning

ICCV 2023arXiv
0
citations

Conditional Convolutions for Instance Segmentation

ECCV 2020
0
citations

Soft Expert Reward Learning for Vision-and-Language Navigation

ECCV 2020
0
citations

Scene Text Image Super-resolution in the wild

ECCV 2020
0
citations

Segmenting Transparent Objects in the Wild

ECCV 2020
0
citations

AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting

ECCV 2020
0
citations

Learning and Memorizing Representative Prototypes for 3D Point Cloud Semantic and Instance Segmentation

ECCV 2020
0
citations

SOLO: Segmenting Objects by Locations

ECCV 2020
0
citations

Instance-Aware Embedding for Point Cloud Instance Segmentation

ECCV 2020
0
citations

PointInst3D: Segmenting 3D Instances by Points

ECCV 2022
0
citations

Poseur: Direct Human Pose Regression with Transformers

ECCV 2022
0
citations

Efficient Decoder-Free Object Detection with Transformers

ECCV 2022
0
citations

DisCo: Remedying Self-Supervised Learning on Lightweight Models with Distilled Contrastive Learning

ECCV 2022
0
citations

Deeply Learning the Messages in Message Passing Inference

NeurIPS 2015arXiv
0
citations

Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image

ICCV 2023arXiv
0
citations

MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation

CVPR 2025
0
citations

POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction

ICCV 2025
0
citations

Fine-grained Abnormality Prompt Learning for Zero-shot Anomaly Detection

ICCV 2025
0
citations

SMSTracker: Tri-path Score Mask Sigma Fusion for Multi-Modal Tracking

ICCV 2025
0
citations

SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting

ICCV 2025
0
citations

Unified Open-World Segmentation with Multi-Modal Prompts

ICCV 2025
0
citations

Retrieval-Augmented Primitive Representations for Compositional Zero-Shot Learning

AAAI 2024
0
citations

DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data

CVPR 2024
0
citations

Traffic Scene Parsing through the TSP6K Dataset

CVPR 2024
0
citations

FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition

CVPR 2024
0
citations

Floating Anchor Diffusion Model for Multi-motif Scaffolding

ICML 2024
0
citations

Generative Active Learning for Long-tailed Instance Segmentation

ICML 2024
0
citations

Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video

NeurIPS 2019
0
citations

Multi-marginal Wasserstein GAN

NeurIPS 2019
0
citations

SOLOv2: Dynamic and Fast Instance Segmentation

NeurIPS 2020
0
citations

Twins: Revisiting the Design of Spatial Attention in Vision Transformers

NeurIPS 2021
0
citations

Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation

NeurIPS 2021
0
citations

SegViT: Semantic Segmentation with Plain Vision Transformers

NeurIPS 2022
0
citations

Hierarchical Normalization for Robust Monocular Depth Estimation

NeurIPS 2022
0
citations

Multi-dataset Training of Transformers for Robust Action Recognition

NeurIPS 2022
0
citations

DENSE: Data-Free One-Shot Federated Learning

NeurIPS 2022
0
citations

Adv-Attribute: Inconspicuous and Transferable Adversarial Attack on Face Recognition

NeurIPS 2022
0
citations

Fully Convolutional One-Stage 3D Object Detection on LiDAR Range Images

NeurIPS 2022
0
citations

PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining

NeurIPS 2022
0
citations

Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval

NeurIPS 2022
0
citations

DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models

NeurIPS 2023
0
citations

Adversarial Learning with Local Coordinate Coding

ICML 2018
0
citations