Irfan Essa
7
Papers
827
Total Citations
Papers (7)
Language Model Beats Diffusion - Tokenizer is key to visual generation
ICLR 2024
525
citations
Photorealistic Video Generation with Diffusion Models
ECCV 2024
264
citations
Calibrated Multi-Preference Optimization for Aligning Diffusion Models
CVPR 2025
24
citations
Limitations in Employing Natural Language Supervision for Sensor-Based Human Activity Recognition - And Ways to Overcome Them
AAAI 2025
9
citations
Cropper: Vision-Language Model for Image Cropping through In-Context Learning
CVPR 2025
5
citations
Prompt-Free Diffusion: Taking “Text” out of Text-to-Image Diffusion Models
CVPR 2024
0
citations
VideoPoet: A Large Language Model for Zero-Shot Video Generation
ICML 2024
0
citations