Haoyuan Li
8
Papers
107
Total Citations
Papers (8)
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation
ICML 2025
63
citations
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
CVPR 2025
44
citations
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
AAAI 2025
0
citations
Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback
AAAI 2025
0
citations
Anomaly Detection of Integrated Circuits Package Substrates Using the Large Vision Model SAIC: Dataset Construction, Methodology, and Application
ICCV 2025
0
citations
DATE: Domain Adaptive Product Seeker for E-Commerce
CVPR 2023
0
citations
Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos
ICCV 2023arXiv
0
citations
Towards Effective Multi-Modal Interchanges in Zero-Resource Sounding Object Localization
NeurIPS 2022
0
citations