Roei Herzig
16
Papers
199
Total Citations
Papers (16)
Compositional Chain-of-Thought Prompting for Large Multimodal Models
CVPR 2024
167
citations
Pre-training Auto-regressive Robotic Models with 4D Representations
ICML 2025
19
citations
Recursive Visual Programming
ECCV 2024
10
citations
Enhancing Few-Shot Vision-Language Classification with Large Multimodal Model Features
ICCV 2025
3
citations
DETReg: Unsupervised Pretraining With Region Priors for Object Detection
CVPR 2022arXiv
0
citations
Object-Region Video Transformers
CVPR 2022
0
citations
Teaching Structured Vision & Language Concepts to Vision & Language Models
CVPR 2023
0
citations
Learning Canonical Representations for Scene Graph to Image Generation
ECCV 2020
0
citations
Unsupervised Domain Generalization by Learning a Bridge Across Domains
CVPR 2022arXiv
0
citations
Unsupervised Universal Image Segmentation
CVPR 2024
0
citations
Precise Detection in Densely Packed Scenes
CVPR 2019
0
citations
Something-Else: Compositional Action Recognition With Spatial-Temporal Interaction Networks
CVPR 2020
0
citations
Mapping Images to Scene Graphs with Permutation-Invariant Structured Prediction
NeurIPS 2018
0
citations
Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens
NeurIPS 2022
0
citations
FETA: Towards Specializing Foundational Models for Expert Task Applications
NeurIPS 2022
0
citations
Dense and Aligned Captions (DAC) Promote Compositional Reasoning in VL Models
NeurIPS 2023
0
citations