What I really want is something with a similar set of convenient APIs and CLIs like ocrmypdf [1] that supports some of the more recent ML based systems. Ocrmypdf has really good ergonomics for me in terms of ease of scripting.
Something like DocTR [2] with the same api would be fantastic.
Something like DocTR [2] with the same api would be fantastic.
[1] https://ocrmypdf.readthedocs.io/en/latest/
[2] https://mindee.github.io/doctr/