Lianwen Jin
13
Papers
215
Total Citations
Papers (13)
FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning
AAAI 2024arXiv
74
citations
CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy
ICCV 2025arXiv
42
citations
DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
CVPR 2024
29
citations
ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining
AAAI 2024arXiv
18
citations
M2Doc: A Multi-Modal Fusion Approach for Document Layout Analysis
AAAI 2024
14
citations
Bridging the Gap Between End-to-End and Two-Step Text Spotting
CVPR 2024
11
citations
DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding
CVPR 2025arXiv
10
citations
Revisiting Tampered Scene Text Detection in the Era of Generative AI
AAAI 2025
10
citations
Predicting the Original Appearance of Damaged Historical Documents
AAAI 2025
7
citations
DocNLC: A Document Image Enhancement Framework with Normalized and Latent Contrastive Representation for Multiple Degradations
AAAI 2024
0
citations
Towards Modern Image Manipulation Localization: A Large-Scale Dataset and Novel Methods
CVPR 2024
0
citations
DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming
AAAI 2025
0
citations
UPOCR: Towards Unified Pixel-Level OCR Interface
ICML 2024
0
citations