Zhihong Zhu
7
Papers
125
Total Citations
Papers (7)
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?
NeurIPS 2025arXiv
57
citations
Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation
AAAI 2025
31
citations
DisPose: Disentangling Pose Guidance for Controllable Human Image Animation
ICLR 2025
23
citations
Exploiting Auxiliary Caption for Video Grounding
AAAI 2024arXiv
14
citations
VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification
CVPR 2025
0
citations
Aligner$^2$: Enhancing Joint Multiple Intent Detection and Slot Filling via Adjustive and Forced Cross-Task Alignment
AAAI 2024
0
citations
Towards Multi-Intent Spoken Language Understanding via Hierarchical Attention and Optimal Transport
AAAI 2024
0
citations