Alireza Fathi
6
Papers
45
Total Citations
Papers (6)
Language-Guided Image Tokenization for Generation
CVPR 2025arXiv
23
citations
Temporal Chain of Thought: Long-Video Understanding by Thinking in Frames
NeurIPS 2025arXiv
8
citations
FirePlace: Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement
CVPR 2025
7
citations
A Generative Approach for Wikipedia-Scale Visual Entity Recognition
CVPR 2024
7
citations
Visual Lexicon: Rich Image Features in Language Space
CVPR 2025
0
citations
SceneCraft: An LLM Agent for Synthesizing 3D Scenes as Blender Code
ICML 2024
0
citations