Mingming Gong
14
Papers
89
Total Citations
Papers (14)
Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models
CVPR 2024
37
citations
LaVin-DiT: Large Vision Diffusion Transformer
CVPR 2025arXiv
19
citations
Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation
AAAI 2024arXiv
10
citations
Interventional Fairness on Partially Known Causal Graphs: A Constrained Optimization Approach
ICLR 2024
8
citations
UNIC-Adapter: Unified Image-instruction Adapter with Multi-modal Transformer for Image Generation
CVPR 2025arXiv
7
citations
Projection Pursuit Density Ratio Estimation
ICML 2025
3
citations
DIDiffGes: Decoupled Semi-Implicit Diffusion Models for Real-time Gesture Generation from Speech
AAAI 2025
3
citations
Detecting Generated Images by Fitting Natural Image Distributions
NeurIPS 2025arXiv
2
citations
On the Recoverability of Causal Relations from Temporally Aggregated I.I.D. Data
ICML 2024
0
citations
Semantic-guided Cross-Modal Prompt Learning for Skeleton-based Zero-shot Action Recognition
CVPR 2025
0
citations
A Reinforcement-Learning-Based Multiple-Column Selection Strategy for Column Generation
AAAI 2024
0
citations
Part-aware Unified Representation of Language and Skeleton for Zero-shot Action Recognition
CVPR 2024
0
citations
Optimal Kernel Choice for Score Function-based Causal Discovery
ICML 2024
0
citations
SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training
CVPR 2025
0
citations