Haoyuan Li

6

Papers

141

Total Citations

Papers (6)

HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation

EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions

LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation

Anomaly Detection of Integrated Circuits Package Substrates Using the Large Vision Model SAIC: Dataset Construction, Methodology, and Application

MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback