Letian Zhang
6
Papers
11
Total Citations
Papers (6)
Oasis: One Image is All You Need for Multimodal Instruction Data Synthesis
ICCV 2025
6
citations
EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language Models
AAAI 2025
5
citations
LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement
ICCV 2025
0
citations
FedEL: Federated Elastic Learning for Heterogeneous Devices
NeurIPS 2025
0
citations
Pre-Trained Vision-Language Models as Noisy Partial Annotators
AAAI 2025
0
citations
What If the TV Was Off? Examining Counterfactual Reasoning Abilities of Multi-modal Language Models
CVPR 2024
0
citations