Shitian Zhao
4
Papers
26
Total Citations
Papers (4)
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions
ICLR 2025
26
citations
FontAnimate: High Quality Few-shot Font Generation via Animating Font Transfer Process
ICCV 2025
0
citations
Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models
CVPR 2024
0
citations
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
ICML 2024
0
citations