Ross Girshick
48
Papers
1,726
Total Citations
Papers (48)
Hypercolumns for Object Segmentation and Fine-Grained Localization
CVPR 2015
1,630
citations
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models
CVPR 2025
96
citations
Aligning 3D Models to RGB-D Images of Cluttered Scenes
CVPR 2015
0
citations
Training Region-Based Object Detectors With Online Hard Example Mining
CVPR 2016
0
citations
You Only Look Once: Unified, Real-Time Object Detection
CVPR 2016
0
citations
Inside-Outside Net: Detecting Objects in Context With Skip Pooling and Recurrent Neural Networks
CVPR 2016
0
citations
Seeing Through the Human Reporting Bias: Visual Classifiers From Noisy Human-Centric Labels
CVPR 2016
0
citations
Aggregated Residual Transformations for Deep Neural Networks
CVPR 2017arXiv
0
citations
Feature Pyramid Networks for Object Detection
CVPR 2017arXiv
0
citations
Learning Features by Watching Objects Move
CVPR 2017arXiv
0
citations
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
CVPR 2017arXiv
0
citations
Learning by Asking Questions
CVPR 2018arXiv
0
citations
Data Distillation: Towards Omni-Supervised Learning
CVPR 2018arXiv
0
citations
Learning to Segment Every Thing
CVPR 2018arXiv
0
citations
Low-Shot Learning From Imaginary Data
CVPR 2018arXiv
0
citations
Non-Local Neural Networks
CVPR 2018arXiv
0
citations
Detecting and Recognizing Human-Object Interactions
CVPR 2018arXiv
0
citations
Long-Term Feature Banks for Detailed Video Understanding
CVPR 2019
0
citations
LVIS: A Dataset for Large Vocabulary Instance Segmentation
CVPR 2019
0
citations
Panoptic Feature Pyramid Networks
CVPR 2019
0
citations
Panoptic Segmentation
CVPR 2019
0
citations
A Multigrid Method for Efficiently Training Video Models
CVPR 2020arXiv
0
citations
PointRend: Image Segmentation As Rendering
CVPR 2020arXiv
0
citations
Momentum Contrast for Unsupervised Visual Representation Learning
CVPR 2020arXiv
0
citations
Designing Network Design Spaces
CVPR 2020arXiv
0
citations
Boundary IoU: Improving Object-Centric Image Segmentation Evaluation
CVPR 2021arXiv
0
citations
A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning
CVPR 2021arXiv
0
citations
Fast and Accurate Model Scaling
CVPR 2021arXiv
0
citations
Revisiting Weakly Supervised Pre-Training of Visual Perception Models
CVPR 2022arXiv
0
citations
Masked Autoencoders Are Scalable Vision Learners
CVPR 2022arXiv
0
citations
Contextual Action Recognition With R*CNN
ICCV 2015
0
citations
Fast R-CNN
ICCV 2015
0
citations
Actions and Attributes From Wholes and Parts
ICCV 2015
0
citations
Mask R-CNN
ICCV 2017arXiv
0
citations
Inferring and Executing Programs for Visual Reasoning
ICCV 2017arXiv
0
citations
Low-Shot Visual Recognition by Shrinking and Hallucinating Features
ICCV 2017arXiv
0
citations
Exploring Randomly Wired Neural Networks for Image Recognition
ICCV 2019
0
citations
TensorMask: A Foundation for Dense Object Segmentation
ICCV 2019
0
citations
Rethinking ImageNet Pre-Training
ICCV 2019
0
citations
The Effectiveness of MAE Pre-Pretraining for Billion-Scale Pretraining
ICCV 2023arXiv
0
citations
Segment Anything
ICCV 2023arXiv
0
citations
Are Labels Necessary for Neural Architecture Search?
ECCV 2020
0
citations
Exploring Plain Vision Transformer Backbones for Object Detection
ECCV 2022
0
citations
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
NeurIPS 2015
0
citations
Focal Loss for Dense Object Detection
ICCV 2017arXiv
0
citations
Deformable Part Models are Convolutional Neural Networks
CVPR 2015
0
citations
PHYRE: A New Benchmark for Physical Reasoning
NeurIPS 2019
0
citations
Unsupervised Deep Embedding for Clustering Analysis
ICML 2016
0
citations