Cong Yao

24
Papers
180
Total Citations

Papers (24)

LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding

CVPR 2024
98
citations

FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

AAAI 2024arXiv
74
citations

ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data

AAAI 2025
6
citations

Platypus: A Generalized Specialist Model for Reading Text in Various Forms

ECCV 2024
2
citations

Robust Scene Text Recognition With Automatic Rectification

CVPR 2016
0
citations

EAST: An Efficient and Accurate Scene Text Detector

CVPR 2017arXiv
0
citations

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

CVPR 2018arXiv
0
citations

On Vocabulary Reliance in Scene Text Recognition

CVPR 2020arXiv
0
citations

MOST: A Multi-Oriented Scene Text Detector With Localization Refinement

CVPR 2021arXiv
0
citations

Vision-Language Pre-Training for Boosting Scene Text Detectors

CVPR 2022arXiv
0
citations

Revisiting Document Image Dewarping by Grid Regularization

CVPR 2022arXiv
0
citations

GeoLayoutLM: Geometric Pre-Training for Visual Information Extraction

CVPR 2023arXiv
0
citations

Conditional Text Image Generation With Diffusion Models

CVPR 2023
0
citations

Modeling Entities As Semantic Points for Visual Information Extraction in the Wild

CVPR 2023arXiv
0
citations

Relaxed Multiple-Instance SVM With Application to Object Discovery

ICCV 2015
0
citations

Symmetry-Constrained Rectification Network for Scene Text Recognition

ICCV 2019
0
citations

LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition

ICCV 2023arXiv
0
citations

Vision Grid Transformer for Document Layout Analysis

ICCV 2023arXiv
0
citations

Differentiable Feature Aggregation Search for Knowledge Distillation

ECCV 2020
0
citations

Levenshtein OCR

ECCV 2022
0
citations

Multi-Granularity Prediction for Scene Text Recognition

ECCV 2022
0
citations

OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition

CVPR 2024
0
citations

Symmetry-Based Text Line Detection in Natural Scenes

CVPR 2015
0
citations

Multi-Oriented Text Detection With Fully Convolutional Networks

CVPR 2016
0
citations