Cong Yao
24
Papers
180
Total Citations
Papers (24)
LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding
CVPR 2024
98
citations
FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning
AAAI 2024arXiv
74
citations
ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data
AAAI 2025
6
citations
Platypus: A Generalized Specialist Model for Reading Text in Various Forms
ECCV 2024
2
citations
Robust Scene Text Recognition With Automatic Rectification
CVPR 2016
0
citations
EAST: An Efficient and Accurate Scene Text Detector
CVPR 2017arXiv
0
citations
Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation
CVPR 2018arXiv
0
citations
On Vocabulary Reliance in Scene Text Recognition
CVPR 2020arXiv
0
citations
MOST: A Multi-Oriented Scene Text Detector With Localization Refinement
CVPR 2021arXiv
0
citations
Vision-Language Pre-Training for Boosting Scene Text Detectors
CVPR 2022arXiv
0
citations
Revisiting Document Image Dewarping by Grid Regularization
CVPR 2022arXiv
0
citations
GeoLayoutLM: Geometric Pre-Training for Visual Information Extraction
CVPR 2023arXiv
0
citations
Conditional Text Image Generation With Diffusion Models
CVPR 2023
0
citations
Modeling Entities As Semantic Points for Visual Information Extraction in the Wild
CVPR 2023arXiv
0
citations
Relaxed Multiple-Instance SVM With Application to Object Discovery
ICCV 2015
0
citations
Symmetry-Constrained Rectification Network for Scene Text Recognition
ICCV 2019
0
citations
LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition
ICCV 2023arXiv
0
citations
Vision Grid Transformer for Document Layout Analysis
ICCV 2023arXiv
0
citations
Differentiable Feature Aggregation Search for Knowledge Distillation
ECCV 2020
0
citations
Levenshtein OCR
ECCV 2022
0
citations
Multi-Granularity Prediction for Scene Text Recognition
ECCV 2022
0
citations
OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition
CVPR 2024
0
citations
Symmetry-Based Text Line Detection in Natural Scenes
CVPR 2015
0
citations
Multi-Oriented Text Detection With Fully Convolutional Networks
CVPR 2016
0
citations