Package Details: ocrmypdf 16.2.0-1

Git Clone URL: https://aur.archlinux.org/ocrmypdf.git (read-only, click to copy)
Package Base: ocrmypdf
Description: A tool to add an OCR text layer to scanned PDF files, allowing them to be searched
Upstream URL: https://github.com/ocrmypdf/OCRmyPDF
Licenses: MPL2
Submitter: dreuter
Maintainer: fbrennan (pigmonkey)
Last Packager: pigmonkey
Votes: 110
Popularity: 1.51
First Submitted: 2014-01-27 11:36 (UTC)
Last Updated: 2024-04-19 19:30 (UTC)

Pinned Comments

fbrennan commented on 2023-05-12 22:54 (UTC)

The flag was invalid and has been removed with no action taken as no new version was released. There's nothing to do for this package; no new release has been made. Rebuild, as @eclairevoyant has said.

Latest Comments

« First ‹ Previous 1 .. 9 10 11 12 13 14 15 16 17 18 19 .. 22 Next › Last »

bsdice commented on 2019-10-25 13:23 (UTC)

@brianmercer et al. Package needs to be updated to 9.0.3 which fixes https://github.com/jbarlow83/OCRmyPDF/commit/17ac9d7a9a296ae3d50146fbefad5281e2851b0f

The backstory is that ghostscript tightened security after taviso took a stab at it security-wise back in summer of 2018. You can fix it yourself in the meantime, by:

1) Downloading raw PKGBUILD file into a temp directory

2) Edit these lines to say

pkgver=9.0.3

source=("https://files.pythonhosted.org/packages/6b/8c/d8a9132e050ac25ea5da63fabc1a1fc0246beee72701b372c35221a40237/ocrmypdf-9.0.3.tar.gz")

sha256sums=('3d9b92f6a01d0711e4156c6b36638d9d946d010e2925ec473ec7f666096cceeb')

3) makepkg -Ccfi

brianmercer commented on 2019-10-21 18:37 (UTC)

I started getting a set of errors with the 9.0.1 version. I edited the PKGBUILD to upgrade to version 9.0.3 of ocrmypdf and they went away.

These are the errors:

ERROR - GPL Ghostscript 9.50: Setting Overprint Mode to 1 not permitted in PDF/A-2, overprint mode not set

Error: /invalidfileaccess in --file-- Operand stack: --nostringval-- --nostringval-- (/usr/lib/python3.7/site-packages/ocrmypdf/data/sRGB.icc) (r) Execution stack: %interp_exit .runexec2 --nostringval-- --nostringval-- --nostringval-- 2 %stopped_push --nostringval-- --nostringval-- --nostringval-- false 1 %stopped_push 1990 1 3 %oparray_pop 1989 1 3 %oparray_pop 1977 1 3 %oparray_pop 1833 1 3 %oparray_pop --nostringval-- %errorexec_pop .runexec2 --nostringval-- --nostringval-- --nostringval-- 2 %stopped_push --nostringval-- Dictionary stack: --dict:737/1123(ro)(G)-- --dict:1/20(G)-- --dict:76/200(L)-- Current allocation mode is local Last OS error: Permission denied Current file position is 580 GPL Ghostscript 9.50: Unrecoverable error, exit code 1 ERROR - SubprocessOutputError: Ghostscript PDF/A rendering failed

Fifis commented on 2019-07-31 13:03 (UTC)

For the latest ocrmypdf 8.3.2, I had to update pikepdf to 1.5.0.post0. Had a bit of trouble overwriting old package versions, e. g. sudo pacman -S --overwrite="*" python-ply python-pycparser img2pdf python-cffi python-defusedxml python-lxml python-reportlab to get ocrmypdf 8.3.2 to work.

john-soda commented on 2019-02-10 23:36 (UTC)

@fbrennan I really don't know what the problem is, that it can't reach setuptools_scm_git_archive. I downloaded the package manually and edited the PKGBUILD that it points to my local downloaded version. Now it works! Thanks for your help.

fbrennan commented on 2019-02-09 05:38 (UTC)

@john-soda I can install the latest version just fine for me, it seems to me you have a DNS resolution problem for the pypi domain.

john-soda commented on 2019-02-02 11:04 (UTC)

When I want to install ocrmypdf I get always the Error:

Could not find suitable distribution for Requirement.parse('setuptools_scm_git_archive')

Here the full log https://pastebin.com/xsqzeqr0

How can I install the newest version?

fbrennan commented on 2019-01-17 08:18 (UTC)

My apologies to all stakeholders waiting on me. I came down with a serious illness. Rest assured this is not forgotten or abandoned. I will update it in due time. Thanks

jbarlow commented on 2019-01-12 08:49 (UTC) (edited on 2019-01-12 08:52 (UTC) by jbarlow)

@fbrennan

v8 makes pdfminer.six "technically optional". setup.py still lists it as required, but downstream maintainers at their option may delete pdfminer.six from setup.py in their scripts, at the cost of the --redo-ocr feature. I will support this arrangement until the packaging situation for pdfminer.six improves. (I am doing it this way because "pip install ocrmypdf" works fine with pdfminer.six.)

v8 also drops python-xmp-toolkit because of the difficulties some downstream consumers had with it.

Thanks again for maintaining ocrmypdf for the ArchLinux community.

-Upstream

fbrennan commented on 2018-11-27 06:19 (UTC)

I thought of that @bsdice but it breaks the AUR Rules of Submission. https://wiki.archlinux.org/index.php/Arch_User_Repository#Rules_of_submission

The more and more AUR dependencies that get added the more difficult this gets to maintain and the more people we need to rely on. Fortunately I maintain python-pikepdf, python-ruffus and python-xmp-toolkit, the major AUR deps of this package before the recent update. I don't think it's kosher for me to make a metapackage which would install every dependency and have a replaces/conflicts/provides either.

For now I recommend users assure that python-sortedcontainers is installed before attempting to build ocrmypdf & deps. I'm sure the maintainer of python-pdfminer.six will add it to the manifest as soon as they can.

bsdice commented on 2018-11-27 05:40 (UTC)

@fbrennan: A workaround could have been to create a package called "python-pdfminer-six" and use the following statements:

  • replaces=('python-pdfminer.six')
  • conflicts=('python-pdfminer.six')
  • provides=('python-pdfminer.six')

Unfortunately python-pdfminer.six in AUR is missing a dependancy. Workaround is to still use http://termbin.com/k46k.