Mark Yatskar
12
Papers
146
Total Citations
Papers (12)
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models
CVPR 2025
96
citations
CoMo: Controllable Motion Generation through Language Guided Pose Code Editing
ECCV 2024arXiv
48
citations
ViUniT: Visual Unit Tests for More Robust Visual Programming
CVPR 2025
2
citations
Commonly Uncommon: Semantic Sparsity in Situation Recognition
CVPR 2017arXiv
0
citations
Neural Motifs: Scene Graph Parsing With Global Context
CVPR 2018arXiv
0
citations
RoboTHOR: An Open Simulation-to-Real Embodied AI Platform
CVPR 2020arXiv
0
citations
Visual Semantic Role Labeling for Video Understanding
CVPR 2021arXiv
0
citations
Language in a Bottle: Language Model Guided Concept Bottlenecks for Interpretable Image Classification
CVPR 2023arXiv
0
citations
Balanced Datasets Are Not Enough: Estimating and Mitigating Gender Bias in Deep Image Representations
ICCV 2019
0
citations
Grounded Situation Recognition
ECCV 2020
0
citations
Holodeck: Language Guided Generation of 3D Embodied AI Environments
CVPR 2024
0
citations
Situation Recognition: Visual Semantic Role Labeling for Image Understanding
CVPR 2016
0
citations