Jing Shi
10
Papers
422
Total Citations
Papers (10)
InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning
CVPR 2024
369
citations
FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity
CVPR 2025
17
citations
FineMatch: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction
ECCV 2024arXiv
14
citations
VIXEN: Visual Text Comparison Network for Image Difference Captioning
AAAI 2024arXiv
9
citations
Visual Persona: Foundation Model for Full-Body Human Customization
CVPR 2025
6
citations
Poplar: Efficient Scaling of Distributed DNN Training on Heterogeneous GPU Clusters
AAAI 2025
5
citations
The Photographer's Eye: Teaching Multimodal Large Language Models to See, and Critique Like Photographers
CVPR 2025
1
citations
Improving Large Vision and Language Models by Learning from a Panel of Peers
ICCV 2025
1
citations
DiffTell: A High-Quality Dataset for Describing Image Manipulation Changes
ICCV 2025
0
citations
Yo’Chameleon: Personalized Vision and Language Generation
CVPR 2025
0
citations