Skip to content
@OCR-D

OCR-D

DFG-Koordinierungsprojekt zur Weiterentwicklung von Verfahren der Optical Character Recognition

Pinned Loading

  1. core core Public

    Collection of OCR-related python tools and wrappers from @OCR-D

    Python 132 32

  2. ocrd_all ocrd_all Public

    Master repository which includes most other OCR-D repositories as submodules

    Makefile 72 19

  3. spec spec Public

    Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)

    Python 17 5

  4. gt-guidelines gt-guidelines Public

    OCR-D guidelines for Ground Truth production

    HTML 6 5

  5. ocrd-webapi-implementation ocrd-webapi-implementation Public

    Python 4

Repositories

Showing 10 of 94 repositories
  • core Public

    Collection of OCR-related python tools and wrappers from @OCR-D

    OCR-D/core’s past year of commit activity
    Python 132 Apache-2.0 32 115 (1 issue needs help) 15 Updated Jan 17, 2026
  • ocrd-website Public
    OCR-D/ocrd-website’s past year of commit activity
    HTML 24 CC-BY-4.0 7 16 1 Updated Jan 14, 2026
  • ocr-d.github.io Public

    Website for OCR-D specs, formats, requirements

    OCR-D/ocr-d.github.io’s past year of commit activity
    HTML 5 2 0 0 Updated Jan 14, 2026
  • OCR-D-GT-VD-SBB Public

    A ground truth (GT) dataset created within the OCR-D project and consisting of 348 pages extracted from historical documents pertaining to the "Verzeichnis der im deutschen Sprachraum erschienenen Drucke" (VD), all of which have been digitised by Staatsbibliothek zu Berlin – Berlin State Library (SBB).

    OCR-D/OCR-D-GT-VD-SBB’s past year of commit activity
    Shell 0 CC-BY-SA-4.0 0 0 0 Updated Nov 26, 2025
  • ocrd_segment Public

    OCR-D-compliant page segmentation

    OCR-D/ocrd_segment’s past year of commit activity
    Python 68 MIT 16 11 1 Updated Nov 19, 2025
  • keyboardGT Public

    Offer of different keyboards for transcription software (Aletheia, Transkribus, LAREX, QURATOR-neat, eScriptorium)

    OCR-D/keyboardGT’s past year of commit activity
    XSLT 1 CC-BY-SA-4.0 1 0 0 Updated Nov 5, 2025
  • ocrd_pagetopdf Public

    OCR-D wrapper for prima-pagetopdf

    OCR-D/ocrd_pagetopdf’s past year of commit activity
    Python 9 Apache-2.0 7 3 0 Updated Oct 30, 2025
  • spec Public

    Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)

    OCR-D/spec’s past year of commit activity
    Python 17 5 42 6 Updated Sep 18, 2025
  • ocrd_keraslm Public

    Simple character-based language model using keras

    OCR-D/ocrd_keraslm’s past year of commit activity
    Python 7 Apache-2.0 6 1 0 Updated Aug 12, 2025
  • ocrd_kraken Public

    Wrapper for the kraken OCR engine

    OCR-D/ocrd_kraken’s past year of commit activity
    Python 13 Apache-2.0 6 4 1 Updated Jul 12, 2025