Thanks for identifying the issue. It looks like v9.8.1 fixes this and is in the process of being pushed to pypi.
Search Criteria
Package Details: ocrmypdf 16.10.1-1
Package Actions
Git Clone URL: | https://aur.archlinux.org/ocrmypdf.git (read-only, click to copy) |
---|---|
Package Base: | ocrmypdf |
Description: | A tool to add an OCR text layer to scanned PDF files, allowing them to be searched |
Upstream URL: | https://github.com/ocrmypdf/OCRmyPDF |
Licenses: | MPL2 |
Submitter: | dreuter |
Maintainer: | fbrennan (pigmonkey) |
Last Packager: | pigmonkey |
Votes: | 129 |
Popularity: | 2.02 |
First Submitted: | 2014-01-27 11:36 (UTC) |
Last Updated: | 2025-04-25 03:37 (UTC) |
Dependencies (21)
- ghostscript
- img2pdf (img2pdf-gitAUR)
- pngquant
- python (python37AUR, python311AUR, python310AUR)
- python-deprecation
- python-importlib_resources
- python-packaging
- python-pdfminer
- python-pikepdf
- python-pillow (python-pillow-simd-gitAUR)
- python-pluggy
- python-reportlab
- python-rich
- python-tqdm
- tesseract (tesseract-gitAUR)
- unpaper (unpaper-gitAUR)
- python-build (make)
- python-hatch-vcs (make)
- python-installer (make)
- python-wheel (make)
- Show 1 more dependencies...
Required by (5)
- docspell-joex (optional)
- dpsprep-git (optional)
- phoronix-test-suite-git (optional)
- python-ocrmypdf-papermerge
- riven-original-soundtrack (make)
Sources (1)
Latest Comments
« First ‹ Previous 1 .. 6 7 8 9 10 11 12 13 14 15 16 .. 22 Next › Last »
pigmonkey commented on 2020-05-28 22:17 (UTC)
brianmercer commented on 2020-05-28 21:18 (UTC)
Temporary workaround is to roll back python-pdfminer to the prior version:
pacman -U /var/cache/pacman/pkg/python-pdfminer-20200402-1-any.pkg.tar.zst
and optionally add
IgnorePkg = python-pdfminer
to the /etc/pacman.conf file to keep it from upgrading for now.
chrisberkhout commented on 2020-05-28 21:05 (UTC)
Last line of the error message is
pkg_resources.DistributionNotFound: The 'pdfminer.six<=20200402,>=20181108' distribution was not found and is required by ocrmypdf
That's from the python-pdfminer
package, which is in the dependencies, it's just that the current version is python-pdfminer-20200517-1
and ocrmypdf
apparently needs an earlier version.
It seems this has happened before: https://github.com/jbarlow83/OCRmyPDF/issues/457
I added a new issue: https://github.com/jbarlow83/OCRmyPDF/issues/566
oriba commented on 2020-05-28 20:26 (UTC)
ocrmypdf, built from this package, does not work anymore. Some days ago it worked. (Sidenote: I also had issues with matplotib, some ugly things may happen these days in the python field).
I got the a quite long message, and one of the things mentioned was "pdfminer.six" together with ContextualVersionConflict.
Looking at the package-dependencies, pdfminer.six is not in there. So it should be added. Also certain versions seem to be needed. Let me know if you want the complete error message, then I may paste it somewhere.
rharish commented on 2020-05-01 12:51 (UTC)
Does there exist a way to avoid using the egg files? Or somehow removing the dependency checks altogether? Installing from the AUR should ensure that the package has its dependencies met, so I don't think that the checks are needed.
I already tried installing it through pip in a virtualenv, along with Pillow-SIMD, and it ignores the checks and directly works with Pillow-SIMD. So those checks can be skipped IMHO.
pigmonkey commented on 2020-04-29 17:55 (UTC)
I'm not sure how to go about supporting Pillow-SMD in the package.
The PKGBUILD installs the package via setuptools, which results in an egg. The egg includes a requirement of Pillow>=6.2.0
. You can see this at /usr/lib/python3.8/site-packages/ocrmypdf-9.7.2-py3.8.egg-info/requires.txt
(or a similar path, depending on your version). That results in the error you're seeing.
I think this would be avoided if the package were installed via pip, but the wiki discourages that. And I think even then you'd end up just moving the problem to a different level: the python-reportlab
package is also installed via setuptools so is going to have an egg that looks for that same Pillow package. You'd get the same error, but it would be thrown by reportlab rather than ocrmypdf.
rharish commented on 2020-04-29 05:41 (UTC) (edited on 2020-04-29 05:41 (UTC) by rharish)
This does not work with Pillow-SIMD. This is the issue that I created upstream. Here are the logs when I run ocrmypdf --help
:
Traceback (most recent call last):
File "/usr/bin/ocrmypdf", line 6, in <module>
from pkg_resources import load_entry_point
File "/usr/lib/python3.8/site-packages/pkg_resources/__init__.py", line 3259, in <module>
def _initialize_master_working_set():
File "/usr/lib/python3.8/site-packages/pkg_resources/__init__.py", line 3242, in _call_aside
f(*args, **kwargs)
File "/usr/lib/python3.8/site-packages/pkg_resources/__init__.py", line 3271, in _initialize_master_working_set
working_set = WorkingSet._build_master()
File "/usr/lib/python3.8/site-packages/pkg_resources/__init__.py", line 584, in _build_master
ws.require(__requires__)
File "/usr/lib/python3.8/site-packages/pkg_resources/__init__.py", line 901, in require
needed = self.resolve(parse_requirements(requirements))
File "/usr/lib/python3.8/site-packages/pkg_resources/__init__.py", line 787, in resolve
raise DistributionNotFound(req, requirers)
pkg_resources.DistributionNotFound: The 'Pillow>=6.2.0' distribution was not found and is required by ocrmypdf
pigmonkey commented on 2020-04-22 17:58 (UTC)
Thanks. It looks like the confusingly named python-pdfminer package in community does indeed provide the needed python-pdfminder.six library rather than the abandoned python-pdfminer library.
That was the last AUR dependency, so maybe there's a chance of this getting adopted into community now.
petRUShka commented on 2020-04-22 10:04 (UTC) (edited on 2020-04-22 10:07 (UTC) by petRUShka)
Dependency aur/python-pdfminer.six
possible should be replaced with community/python-pdfminer
.
Pinned Comments
fbrennan commented on 2023-05-12 22:54 (UTC)
The flag was invalid and has been removed with no action taken as no new version was released. There's nothing to do for this package; no new release has been made. Rebuild, as @eclairevoyant has said.