Yin Cui

23
Papers
88
Total Citations

Papers (23)

Describe Anything: Detailed Localized Image and Video Captioning

ICCV 2025
49
citations

Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation

CVPR 2024
33
citations

ArtiScene: Language-Driven Artistic 3D Scene Generation Through Image Intermediary

CVPR 2025
6
citations

Kernel Pooling for Convolutional Neural Networks

CVPR 2017
0
citations

Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning

CVPR 2018arXiv
0
citations

Learning to Evaluate Image Captioning

CVPR 2018arXiv
0
citations

The INaturalist Species Classification and Detection Dataset

CVPR 2018arXiv
0
citations

Class-Balanced Loss Based on Effective Number of Samples

CVPR 2019
0
citations

SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization

CVPR 2020arXiv
0
citations

Spatiotemporal Contrastive Video Representation Learning

CVPR 2021arXiv
0
citations

Contextualized Spatio-Temporal Contrastive Learning With Self-Supervision

CVPR 2022arXiv
0
citations

Train-Once-for-All Personalization

CVPR 2023
0
citations

Unified Visual Relationship Detection with Vision and Language Models

ICCV 2023arXiv
0
citations

Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset

ECCV 2020
0
citations

Scaling Open-Vocabulary Image Segmentation with Image-Level Labels

ECCV 2022
0
citations

Simple Copy-Paste Is a Strong Data Augmentation Method for Instance Segmentation

CVPR 2021arXiv
0
citations

Learning Deep Representations for Ground-to-Aerial Geolocalization

CVPR 2015
0
citations

Fine-Grained Categorization and Dataset Bootstrapping Using Deep Metric Learning With Humans in the Loop

CVPR 2016
0
citations

Rethinking Pre-training and Self-training

NeurIPS 2020
0
citations

VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text

NeurIPS 2021
0
citations

DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model

NeurIPS 2023
0
citations

Module-wise Adaptive Distillation for Multimodality Foundation Models

NeurIPS 2023
0
citations

Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception

NeurIPS 2023
0
citations