Jun Huang
7
Papers
28
Total Citations
Papers (7)
M2Doc: A Multi-Modal Fusion Approach for Document Layout Analysis
AAAI 2024
14
citations
DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding
CVPR 2025arXiv
10
citations
Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective
ICLR 2025
2
citations
Encapsulated Composition of Text-to-Image and Text-to-Video Models for High-Quality Video Synthesis
CVPR 2025
2
citations
M2SD:Multiple Mixing Self-Distillation for Few-Shot Class-Incremental Learning
AAAI 2024
0
citations
Fingerprinting Denoising Diffusion Probabilistic Models
CVPR 2025
0
citations
Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
CVPR 2024
0
citations