Here are
10 public repositories
matching this topic...
OCR engine for all the languages
Updated
Aug 19, 2025
Python
Probabilistic Key Value pair extraction using word weights from Invoices - Non Searchable PDF
Updated
Jun 12, 2021
Python
✏️ Integration of Tesseract for Python using a shared library
Updated
Mar 25, 2016
Python
Python parser for hOCR files using lxml
Updated
Aug 23, 2020
Python
Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.
Updated
Aug 23, 2025
Python
Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.
Updated
Oct 3, 2023
Python
graphical HOCR editor to produce minimal diffs for proofreading of tesseract OCR output
Updated
Aug 26, 2025
Python
OCR engine for all the languages
Updated
Jan 6, 2023
Python
Updated
Dec 8, 2019
Python
TIFF Image - Converted into OCR XML using Tesseract
Updated
Mar 9, 2024
Python
Improve this page
Add a description, image, and links to the
hocr
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
hocr
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.