About
News About the OCR-D Project Phase II: Projects Phase III: Projects Community Publications and Presentations Data Pilot Study User Survey Contacts Imprint
Technical Resources
Decision Log Ground Truth Guidelines PAGE-XML format documentation OCR-D development best practices Specifications OCR-D/core API Documentation
User Guides & Info
Quick Start Guide Setup Guide User Guide Workflows Models QUIVER (Quality assurance) Glossary
FAQ
Search
For this feature, we implemented Google Programmable Search Engine. If you use it, please note that cookies may be stored and Privacy Policy by Google LLC applies: https://policies.google.com/privacy
de
Aug 6, 2021
OCR-D Phase III started
On 30 July, our kick-off workshop took place, heralding phase III of OCR-D.
Jun 11, 2021
OCR-D at the Bibliothekartag 2021
OCR-D will also be present at this year’s Bibliothekartag, which will take place virtually from 16-18 June 2021 and on two days in Bremen. The OCR-D project is participating with two presentations on the current status of the funding initiative and on the collaborative creation of training materials.
Jun 10, 2021
Implementation and module projects granted
In addition to the coordination project, the DFG also approved seven implementation and module projects that will begin their work in the coming months.
Apr 26, 2021
OCR(-D) & Co starting in May
On 7 May 2021 will be the inaugural session of our new barcamp-like monthly event OCR(-D) & Co. barcamp format, developers, users and all other interested persons will be will be given the opportunity to talk about OCR(-D).
Jan 19, 2021
Phase III of the OCR-D-coordination project granted
The coordination project’s application for the third phase of the OCR-D funding initiative was approved by the DFG in January 2021. In phase III, we will optimise the results of the previous module project phase and we will initiate the productive use of the OCR-D software in mass digitisation both technically and organisationally.
Oct 2, 2020
OCR-D at the Mini-ELAG
On October 20, 2020 the Mini-ELAG (European Library Automation Group) takes place, where librarians and IT professionals discuss new information technologies and their application in libraries and documentation centers. OCR-D will be represented at virtual conference with a lecture by Clemens Neudecker (SBB) on OCR-D: An open ecosystem for improving OCR on historical documents.
Sep 22, 2020
OCR-D at the virtual workshop FAIR & Co
From October 7 to 8, the eHumanities working group of the Union of German Academies of Sciences and Humanities, in cooperation with the Göttingen Academy of Sciences and Humanities, is organizing the workshop FAIR & Co: Visibility and Availability of Digital Academy Research in a Networked Scientific Landscape. The OCR-D project will be represented with a lecture on the topic Digital Transformation: OCR-D, Offer and Vision by Matthias Boenig (BBAW). Using the example of the German Text Archive, it will be shown how the application spectrum of this reference corpus can be extended by the area of machine learning to improve character and structure recognition. For the whole program of the workshop see the website of this workshop.
Aug 1, 2020
Workshop for the implementation plans
Following the successful first (virtual) meeting of those interested in the DFG-call for the implementation of the OCR-D-Software, a further workshop will be held on 7 August, 9-13 p.m. to prepare the OCR-D grant proposals.
Jun 4, 2020
Kick-off pilot phase
We are very happy about the great interest in the DFG call for proposals for the implementation of the OCR-D software. As OCR-D coordination project we will support the planned projects from the pilot phase onwards and promote the exchange of information among interested parties as desired by the DFG. To kick off the pilot phase, we are organising a large video conference on 19 June, 9-13 o’clock, at which all interested parties can get to know each other and the pilot tests can be coordinated. Interested parties who have not submitted a letter of intent themselves and who are still looking for a suitable partner with whom they could engage in the third phase of OCR-D are also welcome to the video conference. If you are interested, please register for the video conference by 12 June at engl@hab.de. We look forward to working with you and to a successful pilot phase!
Feb 25, 2020
Call for OCR-D Implementation online!
The call for the implementation of the OCR-D software for the full text digitisation of historical prints is now available on the website of the German Research Association (DFG).
Feb 19, 2020
OCR-D at the DHd in Paderborn
The OCR-D funding initiative will be represented with several presentations at the DHd 2020, which will take place in Paderborn from 2-6 March. In V15, on March 5, the OCR-D coordination project will discuss the results and perspectives of the funding initiative on the basis of four theses on full text transformation. In the same section, the colleagues of the module project for the development of a model repository and automatic font recognition for OCR-D will report on their findings on the use of fracture in German-language books. For further information on the presentations see the DHd program.
Feb 3, 2020
Full texts - the future of old prints: OCR-D-Workshop in Bonn
The OCR-D workshop “Full texts - the future of old prints” will take place in Bonn on 12 February. The findings and desiderata of the DFG project will be presented and discussed there with a broad audience of developers, OCR experts, users and funding agencies.
Nov 20, 2019
Cooperation with Kitodo signed
Kitodo and OCR-D have signed a Letter of Intent to cooperate on the coordinated and sustainable development and provision of OCR software solutions for mass full text digitization. Kitodo is an open source software suite for the digitisation of cultural property and widely used in the library sector.
Aug 23, 2019
OCR-D at the ICDAR 2019 in Sydney
The OCR-D paper “okralact - a multi-engine Open Source OCR training system (Konstantin Baierer, Rui Dong, Clemens Neudecker) was accepted for the 5th International Workshop on Historical Document Imaging and Processing HIP 2019 (https://www.primaresearch.org/hip2019/) in the context of the ICDAR Conference 2019 in Sydney (https://icdar2019.org/).
Aug 9, 2019
OCR-D in EuropeanaTech Insights
On the occasion of DATeCH 2019, the online journal “EuropeanaTech Insights” dedicated its last issue to the topic of optical character recognition.
May 13, 2019
Best Paper Award for OCR-D at the DATeCH 2019
At the international conference “Digital Access to Textual Cultural Heritage 2019” (DATeCH2019) held in Brussels, the paper on OCR-D entitled “OCR-D: An end-to-end open-source OCR framework for historical documents” was awarded the Best Paper Award. The authors are Clemens Neudecker, Konstantin Baierer, Maria Federbusch, Kay-Michael Würzner, Matthias Boenig, Elisa Herrmann and Volker Hartmann.
Aug 31, 2018
OCR-D extended
The German Research Foundation (DFG) has approved an extension of the OCR-D project for another 18 months. The new funding phase will start in October 2018 and will therefore end in March 2020. This good news allows us to continue supporting the module projects and to consolidate the results. On the other hand, it will also allow the coordination committee’s own work packages to be continued and deepened.
Mar 28, 2018
OCR-D Kick-Off Meeting
From March 5th to 6th the big kick-off meeting of the module projects took place in the Herzog August Bibliothek in Wolfenbüttel, which officially heralds the second phase of OCR-D.
Mar 6, 2017
Call for OCR-D module project proposals
The call for module projects within the framework of OCR-D can now be found online on the website of the German Research Foundation (DFG) (link to the call)
Dec 6, 2016
State Library in Berlin is New Project Partner of OCR-D
The coordinating body of OCR-D consists of the Herzog August Bibliothek Wolfenbüttel, the Berlin-Brandenburg Academy of Sciences and Humanities Berlin, in particular the German Text Archive, and now the Staatsbibliothek zu Berlin. In future, the SBB will take over the work of the Bayerische Staatsbibliothek, which withdrew from the project on 31 August 2016.
Jun 1, 2016
The OCR-D project
OCR-D is a coordination project that is aimed at the further development of Optical Character Recognition (OCR) processes for historical prints.
DFG logo
GitHub | Gitter | Wiki | Quiver | Docker Hub | Technology Watch | sitemap.xml | Imprint