Yin Cui
23
Papers
88
Total Citations
Papers (23)
Describe Anything: Detailed Localized Image and Video Captioning
ICCV 2025
49
citations
Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation
CVPR 2024
33
citations
ArtiScene: Language-Driven Artistic 3D Scene Generation Through Image Intermediary
CVPR 2025
6
citations
Kernel Pooling for Convolutional Neural Networks
CVPR 2017
0
citations
Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning
CVPR 2018arXiv
0
citations
Learning to Evaluate Image Captioning
CVPR 2018arXiv
0
citations
The INaturalist Species Classification and Detection Dataset
CVPR 2018arXiv
0
citations
Class-Balanced Loss Based on Effective Number of Samples
CVPR 2019
0
citations
SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization
CVPR 2020arXiv
0
citations
Spatiotemporal Contrastive Video Representation Learning
CVPR 2021arXiv
0
citations
Contextualized Spatio-Temporal Contrastive Learning With Self-Supervision
CVPR 2022arXiv
0
citations
Train-Once-for-All Personalization
CVPR 2023
0
citations
Unified Visual Relationship Detection with Vision and Language Models
ICCV 2023arXiv
0
citations
Fashionpedia: Ontology, Segmentation, and an Attribute Localization Dataset
ECCV 2020
0
citations
Scaling Open-Vocabulary Image Segmentation with Image-Level Labels
ECCV 2022
0
citations
Simple Copy-Paste Is a Strong Data Augmentation Method for Instance Segmentation
CVPR 2021arXiv
0
citations
Learning Deep Representations for Ground-to-Aerial Geolocalization
CVPR 2015
0
citations
Fine-Grained Categorization and Dataset Bootstrapping Using Deep Metric Learning With Humans in the Loop
CVPR 2016
0
citations
Rethinking Pre-training and Self-training
NeurIPS 2020
0
citations
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
NeurIPS 2021
0
citations
DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
NeurIPS 2023
0
citations
Module-wise Adaptive Distillation for Multimodality Foundation Models
NeurIPS 2023
0
citations
Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception
NeurIPS 2023
0
citations