Package Details: ocrmypdf 10.2.0-1

Git Clone URL: (read-only, click to copy)
Package Base: ocrmypdf
Description: A tool to add an OCR text layer to scanned PDF files, allowing them to be searched
Upstream URL:
Licenses: GPL3
Submitter: dreuter
Maintainer: fbrennan (pigmonkey)
Last Packager: pigmonkey
Votes: 43
Popularity: 2.84
First Submitted: 2014-01-27 11:36
Last Updated: 2020-07-03 02:48

Latest Comments

1 2 3 4 5 6 ... Next › Last »

xuanruiqi commented on 2020-07-03 02:28

Now that python-pillow in community has been updated to 7.2.0, the block on updating this should be no longer existent.

pigmonkey commented on 2020-06-14 18:58

I pinged the python-pillow packager. The package had simply fallen through the cracks and he will be updating it today, but 7.0 introduced some API breakage so the upgraded package will probably hang out in the testing repo for a bit.

fbrennan commented on 2020-06-13 01:03

It might make sense to put it an orphan request for python-pillow-git, then update that, then temporarily require it, @pigmonkey, given how long the community package has been out of date. Though, it's of course up to you, as it might be too much work.

jbarlow commented on 2020-06-13 00:41

Upstream here. I noticed python-pillow in AUR is quite old so this could be a blocker for some time.

ocrmypdf does work with pillow 6.2.1, with all tests passing. You could override the requirement and permit the earlier pillow. (I'd rather not change this upstream, so that upstream reflects the configuration that is being tested.)

On another note, I strongly doubt that pillow-simd would yield any measurable change in performance so it would not be worth the effort to integrate this.

pigmonkey commented on 2020-06-12 21:45

This package is stuck on 9.8.2 until the community python-pillow package is upgraded to >=7.0.0.

pigmonkey commented on 2020-05-28 22:17

Thanks for identifying the issue. It looks like v9.8.1 fixes this and is in the process of being pushed to pypi.

brianmercer commented on 2020-05-28 21:18

Temporary workaround is to roll back python-pdfminer to the prior version:

pacman -U /var/cache/pacman/pkg/python-pdfminer-20200402-1-any.pkg.tar.zst

and optionally add

IgnorePkg = python-pdfminer

to the /etc/pacman.conf file to keep it from upgrading for now.

chrisberkhout commented on 2020-05-28 21:05

Last line of the error message is

pkg_resources.DistributionNotFound: The 'pdfminer.six<=20200402,>=20181108' distribution was not found and is required by ocrmypdf

That's from the python-pdfminer package, which is in the dependencies, it's just that the current version is python-pdfminer-20200517-1 and ocrmypdf apparently needs an earlier version.

It seems this has happened before:

I added a new issue:

oriba commented on 2020-05-28 20:26

ocrmypdf, built from this package, does not work anymore. Some days ago it worked. (Sidenote: I also had issues with matplotib, some ugly things may happen these days in the python field).

I got the a quite long message, and one of the things mentioned was "pdfminer.six" together with ContextualVersionConflict.

Looking at the package-dependencies, pdfminer.six is not in there. So it should be added. Also certain versions seem to be needed. Let me know if you want the complete error message, then I may paste it somewhere.

rharish commented on 2020-05-01 12:51

Does there exist a way to avoid using the egg files? Or somehow removing the dependency checks altogether? Installing from the AUR should ensure that the package has its dependencies met, so I don't think that the checks are needed.

I already tried installing it through pip in a virtualenv, along with Pillow-SIMD, and it ignores the checks and directly works with Pillow-SIMD. So those checks can be skipped IMHO.