Package Details: ocrmypdf 16.7.0-1

Git Clone URL: https://aur.archlinux.org/ocrmypdf.git (read-only, click to copy)
Package Base: ocrmypdf
Description: A tool to add an OCR text layer to scanned PDF files, allowing them to be searched
Upstream URL: https://github.com/ocrmypdf/OCRmyPDF
Licenses: MPL2
Submitter: dreuter
Maintainer: fbrennan (pigmonkey)
Last Packager: pigmonkey
Votes: 124
Popularity: 2.78
First Submitted: 2014-01-27 11:36 (UTC)
Last Updated: 2024-12-10 05:10 (UTC)

Pinned Comments

fbrennan commented on 2023-05-12 22:54 (UTC)

The flag was invalid and has been removed with no action taken as no new version was released. There's nothing to do for this package; no new release has been made. Rebuild, as @eclairevoyant has said.

Latest Comments

« First ‹ Previous 1 .. 6 7 8 9 10 11 12 13 14 15 16 .. 22 Next › Last »

chrisberkhout commented on 2020-05-28 21:05 (UTC)

Last line of the error message is

pkg_resources.DistributionNotFound: The 'pdfminer.six<=20200402,>=20181108' distribution was not found and is required by ocrmypdf

That's from the python-pdfminer package, which is in the dependencies, it's just that the current version is python-pdfminer-20200517-1 and ocrmypdf apparently needs an earlier version.

It seems this has happened before: https://github.com/jbarlow83/OCRmyPDF/issues/457

I added a new issue: https://github.com/jbarlow83/OCRmyPDF/issues/566

oriba commented on 2020-05-28 20:26 (UTC)

ocrmypdf, built from this package, does not work anymore. Some days ago it worked. (Sidenote: I also had issues with matplotib, some ugly things may happen these days in the python field).

I got the a quite long message, and one of the things mentioned was "pdfminer.six" together with ContextualVersionConflict.

Looking at the package-dependencies, pdfminer.six is not in there. So it should be added. Also certain versions seem to be needed. Let me know if you want the complete error message, then I may paste it somewhere.

rharish commented on 2020-05-01 12:51 (UTC)

Does there exist a way to avoid using the egg files? Or somehow removing the dependency checks altogether? Installing from the AUR should ensure that the package has its dependencies met, so I don't think that the checks are needed.

I already tried installing it through pip in a virtualenv, along with Pillow-SIMD, and it ignores the checks and directly works with Pillow-SIMD. So those checks can be skipped IMHO.

pigmonkey commented on 2020-04-29 17:55 (UTC)

I'm not sure how to go about supporting Pillow-SMD in the package.

The PKGBUILD installs the package via setuptools, which results in an egg. The egg includes a requirement of Pillow>=6.2.0. You can see this at /usr/lib/python3.8/site-packages/ocrmypdf-9.7.2-py3.8.egg-info/requires.txt (or a similar path, depending on your version). That results in the error you're seeing.

I think this would be avoided if the package were installed via pip, but the wiki discourages that. And I think even then you'd end up just moving the problem to a different level: the python-reportlab package is also installed via setuptools so is going to have an egg that looks for that same Pillow package. You'd get the same error, but it would be thrown by reportlab rather than ocrmypdf.

rharish commented on 2020-04-29 05:41 (UTC) (edited on 2020-04-29 05:41 (UTC) by rharish)

This does not work with Pillow-SIMD. This is the issue that I created upstream. Here are the logs when I run ocrmypdf --help:

Traceback (most recent call last):
File "/usr/bin/ocrmypdf", line 6, in <module>
    from pkg_resources import load_entry_point
File "/usr/lib/python3.8/site-packages/pkg_resources/__init__.py", line 3259, in <module>
    def _initialize_master_working_set():
File "/usr/lib/python3.8/site-packages/pkg_resources/__init__.py", line 3242, in _call_aside
    f(*args, **kwargs)
File "/usr/lib/python3.8/site-packages/pkg_resources/__init__.py", line 3271, in _initialize_master_working_set
    working_set = WorkingSet._build_master()
File "/usr/lib/python3.8/site-packages/pkg_resources/__init__.py", line 584, in _build_master
    ws.require(__requires__)
File "/usr/lib/python3.8/site-packages/pkg_resources/__init__.py", line 901, in require
    needed = self.resolve(parse_requirements(requirements))
File "/usr/lib/python3.8/site-packages/pkg_resources/__init__.py", line 787, in resolve
    raise DistributionNotFound(req, requirers)
pkg_resources.DistributionNotFound: The 'Pillow>=6.2.0' distribution was not found and is required by ocrmypdf

pigmonkey commented on 2020-04-22 17:58 (UTC)

Thanks. It looks like the confusingly named python-pdfminer package in community does indeed provide the needed python-pdfminder.six library rather than the abandoned python-pdfminer library.

That was the last AUR dependency, so maybe there's a chance of this getting adopted into community now.

petRUShka commented on 2020-04-22 10:04 (UTC) (edited on 2020-04-22 10:07 (UTC) by petRUShka)

Dependency aur/python-pdfminer.six possible should be replaced with community/python-pdfminer.

fbrennan commented on 2020-04-10 23:20 (UTC)

No, the computer was broken in transit. I still have it, just due to the pandemic the parts to fix it are arriving slowly. And it's made for 240V, and I now live in a 120V country.

bsdice commented on 2020-04-10 22:41 (UTC)

@fbrennan Does that mean some Philippine police jockey can now upload trojaned PKGBUILDs in your name? If true we should summon the help of an AUR admin to delete your key.