Bibliotheca Baltica 2018 Rostock, 2018-10-05
https://ocr-d.github.io/2018-10-05-baltica/
https://ocr-d.github.io/2018-10-05-baltica/slides.pdf
DFG-organized expert Workshop (2014)
Verfahren zur Verbesserung von OCR-Ergebnissen
Result: A concerted effort for improving OCR is seen as required.
Surveyed (open-source) ecosystem
around OCR and OLR
Identified Tasks
Prepared call for proposals for DFG
Integrate with existing digitization workflow software, e.g. Kitodo
Make OCR-D-developed software uniformly deployable
Advise DFG on OCR requirements for "Praxisrichtlinien"
existing tools by OCR-D partners
(tesseract, PoCoTo, LAREX...)
new developments within OCR-D
(font identification, post-correction...)
existing tools outside OCR-D
(ocropus, kraken, ScanTailor, OLENA...)
Python widely used in computer vision and machine learning
(keras, pytorch...)
Wrapping existing tools with minimal friction
(ocropus, kraken ...)
Bindings for low-level APIs
(opencv, tesserocr ...)
Lowest common denominator
Wrap arbitrary command line tools
Process callout possible in every
framework/workflow engine/programming environment
Tool | Developer | Functionality | Wrapper |
---|---|---|---|
anyOCR | DFKI Kaiserslautern | binarization, cropping, deskewing, dewarping | (python) |
OLENA | OCR-D | binarization | shell |
tesseract | UB Mannheim, ASV Leipzig |
binarization | python |
OCRopus | OCR-D | binarization | python |
kraken | OCR-D | binarization | python |
ImageMagick | OCR-D | binarization, conversion | shell |
Tool | Developer | Functionality | Wrapper |
---|---|---|---|
anyOCR | DFKI Kaiserslautern | block+line seg8n, block class7n, document analysis | (python) |
LAREX | Uni Würzburg | block+line seg8n, block class7n | (shell) |
OCRopus | OCR-D | line seg8n | python |
kraken | OCR-D | line seg8n | python |
tesseract | UB Mannheim, ASV Leipzig |
block+line seg8n | python |
dh_segment | OCR-D | block+line seg8n | (shell) |
Tool | Developer | Functionality | Wrapper |
---|---|---|---|
OCRopus | OCR-D | text recognition | python |
kraken | OCR-D | text recognition | python |
tesseract | UB Mannheim, ASV Leipzig |
text recognition | python |
calamari | OCR-D | text recognition | (python) |
ocrad | OCR-D | text recognition | (shell) |
Tool | Developer | Functionality | Wrapper |
---|---|---|---|
corASV | ASV Leipzig | post correction | (python) |
PoCoTo | CIS München | post correction | python |
keraslm | ASV Leipzig | post correction | python |
ocrevalUAtion | OCR-D | evaluation | (shell) |
Cooperation with Kitodo and commercial providers
Frequent reality check with current practices ("Pilotbibliotheken")
Different deployment options
(Instantiation and composition up to users)
Transparency from day one
Unit tests
Unified test assets
Continuous Integration
Semantic versioning
Docker base image
Releases to GitHub, PyPI, DockerHub
Issues
Pull requests
Code review
Support chat