Kaiming He

46
Papers
6,103
Total Citations

Papers (46)

R-FCN: Object Detection via Region-based Fully Convolutional Networks

NeurIPS 2016arXiv
5,936
citations

Transformers without Normalization

CVPR 2025arXiv
96
citations

A Decade's Battle on Dataset Bias: Are We There Yet?

ICLR 2025
52
citations

Is Noise Conditioning Necessary for Denoising Generative Models?

ICML 2025
19
citations

Convolutional Feature Masking for Joint Object and Stuff Segmentation

CVPR 2015
0
citations

Convolutional Neural Networks at Constrained Time Cost

CVPR 2015
0
citations

Deep Residual Learning for Image Recognition

CVPR 2016
0
citations

Instance-Aware Semantic Segmentation via Multi-Task Network Cascades

CVPR 2016
0
citations

ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic Segmentation

CVPR 2016
0
citations

Aggregated Residual Transformations for Deep Neural Networks

CVPR 2017arXiv
0
citations

Feature Pyramid Networks for Object Detection

CVPR 2017arXiv
0
citations

Data Distillation: Towards Omni-Supervised Learning

CVPR 2018arXiv
0
citations

Learning to Segment Every Thing

CVPR 2018arXiv
0
citations

Non-Local Neural Networks

CVPR 2018arXiv
0
citations

Detecting and Recognizing Human-Object Interactions

CVPR 2018arXiv
0
citations

Long-Term Feature Banks for Detailed Video Understanding

CVPR 2019
0
citations

Feature Denoising for Improving Adversarial Robustness

CVPR 2019
0
citations

Panoptic Feature Pyramid Networks

CVPR 2019
0
citations

Panoptic Segmentation

CVPR 2019
0
citations

A Multigrid Method for Efficiently Training Video Models

CVPR 2020arXiv
0
citations

PointRend: Image Segmentation As Rendering

CVPR 2020arXiv
0
citations

Momentum Contrast for Unsupervised Visual Representation Learning

CVPR 2020arXiv
0
citations

Designing Network Design Spaces

CVPR 2020arXiv
0
citations

Exploring Simple Siamese Representation Learning

CVPR 2021arXiv
0
citations

A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning

CVPR 2021arXiv
0
citations

Masked Autoencoders Are Scalable Vision Learners

CVPR 2022arXiv
0
citations

Scaling Language-Image Pre-Training via Masking

CVPR 2023arXiv
0
citations

Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification

ICCV 2015
0
citations

BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation

ICCV 2015
0
citations

Transitive Invariance for Self-Supervised Visual Representation Learning

ICCV 2017arXiv
0
citations

Mask R-CNN

ICCV 2017arXiv
0
citations

Focal Loss for Dense Object Detection

ICCV 2017arXiv
0
citations

Exploring Randomly Wired Neural Networks for Image Recognition

ICCV 2019
0
citations

Rethinking ImageNet Pre-Training

ICCV 2019
0
citations

SlowFast Networks for Video Recognition

ICCV 2019
0
citations

Deep Hough Voting for 3D Object Detection in Point Clouds

ICCV 2019
0
citations

An Empirical Study of Training Self-Supervised Vision Transformers

ICCV 2021arXiv
0
citations

Are Labels Necessary for Neural Architecture Search?

ECCV 2020
0
citations

Exploring Plain Vision Transformer Backbones for Object Detection

ECCV 2022
0
citations

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

NeurIPS 2015
0
citations

TensorMask: A Foundation for Dense Object Segmentation

ICCV 2019
0
citations

A Geodesic-Preserving Method for Image Warping

CVPR 2015
0
citations

Efficient and Accurate Approximations of Nonlinear Convolutional Networks

CVPR 2015
0
citations

Sparse Projections for High-Dimensional Binary Codes

CVPR 2015
0
citations

GLoMo: Unsupervised Learning of Transferable Relational Graphs

NeurIPS 2018
0
citations

Masked Autoencoders As Spatiotemporal Learners

NeurIPS 2022
0
citations