Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/Vision AI/Image to Data/
OCR
Search

OCR

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2022 Jun 26 3:14
Editor
Editor
Seonglae ChoSeonglae Cho
Edited
Edited
2025 Aug 25 22:3
Refs
Refs
OCR Tools
zerox
getomni-ai • Updated 2025 Aug 25 19:4
surya
datalab-to • Updated 2025 Aug 25 20:42
Easy OCR
tesseract
tesseract-ocr • Updated 2025 Aug 25 21:51
LlamaOCR
OCRmyPDF
ocrmypdf • Updated 2025 Aug 25 20:58
 
 
AllenAI
olmOCR – Open-Source OCR for Accurate Document Conversion
olmOCR is an open-source tool for converting PDFs to text with high accuracy, preserving reading order and supporting tables, equations, and handwriting.
olmOCR – Open-Source OCR for Accurate Document Conversion
https://olmocr.allenai.org/blog
olmOCR – Open-Source OCR for Accurate Document Conversion
stepfun-ai/GOT-OCR2_0 · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
stepfun-ai/GOT-OCR2_0 · Hugging Face
https://huggingface.co/stepfun-ai/GOT-OCR2_0
stepfun-ai/GOT-OCR2_0 · Hugging Face

OCR attention head

Unlike general retrieval heads, specialized for text recognition in images
arxiv.org
https://arxiv.org/pdf/2505.15865

Large Dataset

nvidia/Llama-Nemotron-VLM-Dataset-v1 · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
nvidia/Llama-Nemotron-VLM-Dataset-v1 · Datasets at Hugging Face
https://huggingface.co/datasets/nvidia/Llama-Nemotron-VLM-Dataset-v1
nvidia/Llama-Nemotron-VLM-Dataset-v1 · Datasets at Hugging Face
 
 

Backlinks

Image 2 LaTexDocument Language

Recommendations

Texonom
Texonom
/
Engineering
Engineering
/Data Engineering/Artificial Intelligence/AI Object/Vision AI/Image to Data/
OCR
Copyright Seonglae Cho