Zaid Khan
7
Papers
35
Total Citations
Papers (7)
Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering
CVPR 2024
21
citations
DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
ICLR 2025arXiv
8
citations
DWIM: Towards Tool-aware Visual Reasoning via Discrepancy-aware Workflow Generation & Instruct-Masking Tuning
ICCV 2025arXiv
6
citations
Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement
CVPR 2024
0
citations
Q: How To Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!
CVPR 2023
0
citations
Single-Stream Multi-level Alignment for Vision-Language Pretraining
ECCV 2022
0
citations
Exploring Question Decomposition for Zero-Shot VQA
NeurIPS 2023
0
citations