Ross Girshick

48
Papers
1,726
Total Citations

Papers (48)

Hypercolumns for Object Segmentation and Fine-Grained Localization

CVPR 2015
1,630
citations

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models

CVPR 2025
96
citations

Aligning 3D Models to RGB-D Images of Cluttered Scenes

CVPR 2015
0
citations

Training Region-Based Object Detectors With Online Hard Example Mining

CVPR 2016
0
citations

You Only Look Once: Unified, Real-Time Object Detection

CVPR 2016
0
citations

Inside-Outside Net: Detecting Objects in Context With Skip Pooling and Recurrent Neural Networks

CVPR 2016
0
citations

Seeing Through the Human Reporting Bias: Visual Classifiers From Noisy Human-Centric Labels

CVPR 2016
0
citations

Aggregated Residual Transformations for Deep Neural Networks

CVPR 2017arXiv
0
citations

Feature Pyramid Networks for Object Detection

CVPR 2017arXiv
0
citations

Learning Features by Watching Objects Move

CVPR 2017arXiv
0
citations

CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning

CVPR 2017arXiv
0
citations

Learning by Asking Questions

CVPR 2018arXiv
0
citations

Data Distillation: Towards Omni-Supervised Learning

CVPR 2018arXiv
0
citations

Learning to Segment Every Thing

CVPR 2018arXiv
0
citations

Low-Shot Learning From Imaginary Data

CVPR 2018arXiv
0
citations

Non-Local Neural Networks

CVPR 2018arXiv
0
citations

Detecting and Recognizing Human-Object Interactions

CVPR 2018arXiv
0
citations

Long-Term Feature Banks for Detailed Video Understanding

CVPR 2019
0
citations

LVIS: A Dataset for Large Vocabulary Instance Segmentation

CVPR 2019
0
citations

Panoptic Feature Pyramid Networks

CVPR 2019
0
citations

Panoptic Segmentation

CVPR 2019
0
citations

A Multigrid Method for Efficiently Training Video Models

CVPR 2020arXiv
0
citations

PointRend: Image Segmentation As Rendering

CVPR 2020arXiv
0
citations

Momentum Contrast for Unsupervised Visual Representation Learning

CVPR 2020arXiv
0
citations

Designing Network Design Spaces

CVPR 2020arXiv
0
citations

Boundary IoU: Improving Object-Centric Image Segmentation Evaluation

CVPR 2021arXiv
0
citations

A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning

CVPR 2021arXiv
0
citations

Fast and Accurate Model Scaling

CVPR 2021arXiv
0
citations

Revisiting Weakly Supervised Pre-Training of Visual Perception Models

CVPR 2022arXiv
0
citations

Masked Autoencoders Are Scalable Vision Learners

CVPR 2022arXiv
0
citations

Contextual Action Recognition With R*CNN

ICCV 2015
0
citations

Fast R-CNN

ICCV 2015
0
citations

Actions and Attributes From Wholes and Parts

ICCV 2015
0
citations

Mask R-CNN

ICCV 2017arXiv
0
citations

Inferring and Executing Programs for Visual Reasoning

ICCV 2017arXiv
0
citations

Low-Shot Visual Recognition by Shrinking and Hallucinating Features

ICCV 2017arXiv
0
citations

Exploring Randomly Wired Neural Networks for Image Recognition

ICCV 2019
0
citations

TensorMask: A Foundation for Dense Object Segmentation

ICCV 2019
0
citations

Rethinking ImageNet Pre-Training

ICCV 2019
0
citations

The Effectiveness of MAE Pre-Pretraining for Billion-Scale Pretraining

ICCV 2023arXiv
0
citations

Segment Anything

ICCV 2023arXiv
0
citations

Are Labels Necessary for Neural Architecture Search?

ECCV 2020
0
citations

Exploring Plain Vision Transformer Backbones for Object Detection

ECCV 2022
0
citations

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

NeurIPS 2015
0
citations

Focal Loss for Dense Object Detection

ICCV 2017arXiv
0
citations

Deformable Part Models are Convolutional Neural Networks

CVPR 2015
0
citations

PHYRE: A New Benchmark for Physical Reasoning

NeurIPS 2019
0
citations

Unsupervised Deep Embedding for Clustering Analysis

ICML 2016
0
citations