Package Details: ocrmypdf 16.2.0-1

Git Clone URL: https://aur.archlinux.org/ocrmypdf.git (read-only, click to copy)
Package Base: ocrmypdf
Description: A tool to add an OCR text layer to scanned PDF files, allowing them to be searched
Upstream URL: https://github.com/ocrmypdf/OCRmyPDF
Licenses: MPL2
Submitter: dreuter
Maintainer: fbrennan (pigmonkey)
Last Packager: pigmonkey
Votes: 110
Popularity: 1.51
First Submitted: 2014-01-27 11:36 (UTC)
Last Updated: 2024-04-19 19:30 (UTC)

Pinned Comments

fbrennan commented on 2023-05-12 22:54 (UTC)

The flag was invalid and has been removed with no action taken as no new version was released. There's nothing to do for this package; no new release has been made. Rebuild, as @eclairevoyant has said.

Latest Comments

« First ‹ Previous 1 .. 10 11 12 13 14 15 16 17 18 19 20 .. 22 Next › Last »

Harvey commented on 2018-11-26 15:44 (UTC)

python-pdfminer.six has been updated to version 20181108-1 ;)

fbrennan commented on 2018-11-26 11:50 (UTC) (edited on 2018-11-26 11:52 (UTC) by fbrennan)

Unfortunately my friends, we've hit a snag. Someone else is using the name python-pdfminer.six :-(

https://aur.archlinux.org/pkgbase/python-pdfminer.six/#news

I put a working PKGBUILD there. But unfortunately I cannot upload my new ocrmypdf, which works fine, until this user makes a decision. That is because ocrmypdf requires a higher version than theirs.

https://github.com/jbarlow83/OCRmyPDF/blob/0f5c484b626632aa68259eda16ff2c1b87a42104/requirements/main.txt#L7

I sincerely apologize for the long wait. If you are good with makepkg and pacman, you can use these two PKGBUILDS:

If not, you will just have to wait for python-pdfminer.six to be updated, by either ishitatsuyuki or me if he orphans.

fbrennan commented on 2018-11-23 23:08 (UTC)

I hope to release the upgrade to 7.3.1 today (GMT+8).

I apologize for the wait after the orphan notification.

Harvey commented on 2018-11-22 11:58 (UTC)

Version 7.3.1 looks very promising, depending on the release notes https://github.com/jbarlow83/OCRmyPDF/blob/master/docs/release_notes.rst Is there any chance for an update? I see there is a new dependency to pdfminer.six 20181108...

marlemion commented on 2018-11-06 09:18 (UTC) (edited on 2018-11-06 09:32 (UTC) by marlemion)

@bsdice: Thanks, but that did not help. Same error. On another machine, ocrmypdf is working. So it must be some issue on that machine...

Btw. ocrmypdf was working for ages on that machine, but I had to hold back leptonica for other reasons, so it was stuck to a certain version for some time....

Found the Problem: I had installed python2-jmespath-0.9.3-2. This package installs /usr/bin/jp2.py. For some reason, python looked at this jp2.py instead of /usr/lib/python3.x/site-packages/jp2.py. After removing python2-jmespath-0.9.3-2, it works. However, such a behaviour is irritating.

bsdice commented on 2018-11-06 09:16 (UTC)

@marlemion: Replace aur/img2pdf-git 0.2.1.r8.geedf73e-1 with normal img2pdf 0.3.1-1 and see what happens. pacman -Rd img2pdf-git ; pacman -S --asdeps img2pdf ; or something like that.

marlemion commented on 2018-11-06 08:57 (UTC)

I would like to update to the most recent version of ocrmypdf. Builds fine, but throws this error:

Traceback (most recent call last): File "/usr/bin/ocrmypdf", line 11, in <module> load_entry_point('ocrmypdf==7.2.1', 'console_scripts', 'ocrmypdf')() File "/usr/lib/python3.7/site-packages/pkg_resources/init.py", line 484, in load_entry_point return get_distribution(dist).load_entry_point(group, name) File "/usr/lib/python3.7/site-packages/pkg_resources/init.py", line 2725, in load_entry_point return ep.load() File "/usr/lib/python3.7/site-packages/pkg_resources/init.py", line 2343, in load return self.resolve() File "/usr/lib/python3.7/site-packages/pkg_resources/init.py", line 2349, in resolve module = import(self.module_name, fromlist=['name'], level=0) File "/usr/lib/python3.7/site-packages/ocrmypdf/main.py", line 36, in <module> from ._pipeline import build_pipeline File "/usr/lib/python3.7/site-packages/ocrmypdf/_pipeline.py", line 26, in <module> import img2pdf File "/usr/lib/python3.7/site-packages/img2pdf.py", line 28, in <module> from jp2 import parsejp2 ImportError: cannot import name 'parsejp2' from 'jp2' (/usr/bin/jp2.py)

img2pdf-git has been rebuilt. No effect.

fbrennan commented on 2018-10-02 03:52 (UTC)

I think lossy mode should still be selectable because it's only dangerous in certain situations and leads to really small files otherwise. It just shouldn't be default.