ocrd.processor.helpers module¶
Helper methods for running and documenting processors
- ocrd.processor.helpers.run_cli(executable, mets_url=None, resolver=None, workspace=None, page_id=None, overwrite=None, debug=None, log_level=None, log_filename=None, input_file_grp=None, output_file_grp=None, parameter=None, working_dir=None, mets_server_url=None)[source]¶
Open a workspace and run a processor on the command line.
If
workspaceis not none, reuse that. Otherwise, instantiate anWorkspaceformets_url(andworking_dir) by usingocrd.Resolver.workspace_from_url()(i.e. open or clone local workspace).Run the processor CLI
executableon the workspace, passing: - the workspace, -page_id-input_file_grp-output_file_grp-parameter(after applying anyparameter_overridesettings)(Will create output files and update the in the filesystem).
- Parameters:
executable (string) – Executable name of the module processor.
- ocrd.processor.helpers.run_processor(processorClass, mets_url=None, resolver=None, workspace=None, page_id=None, log_level=None, input_file_grp=None, output_file_grp=None, parameter=None, working_dir=None, mets_server_url=None, instance_caching=False)[source]¶
Instantiate a Pythonic processor, open a workspace, run the processor and save the workspace.
If
workspaceis not none, reuse that. Otherwise, instantiate anWorkspaceformets_url(andworking_dir) by usingocrd.Resolver.workspace_from_url()(i.e. open or clone local workspace).Instantiate a Python object for
processorClass, passing: - the workspace, -page_id-input_file_grp-output_file_grp-parameter(after applying anyparameter_overridesettings)Warning: Avoid setting the instance_caching flag to True. It may have unexpected side effects. This flag is used for an experimental feature we would like to adopt in future.
Run the processor on the workspace (creating output files in the filesystem).
Finally, write back the workspace (updating the METS in the filesystem).
- Parameters:
processorClass (object) – Python class of the module processor.