Package Details: ocrmypdf 16.6.1-1

Git Clone URL: https://aur.archlinux.org/ocrmypdf.git (read-only, click to copy)
Package Base: ocrmypdf
Description: A tool to add an OCR text layer to scanned PDF files, allowing them to be searched
Upstream URL: https://github.com/ocrmypdf/OCRmyPDF
Licenses: MPL2
Submitter: dreuter
Maintainer: fbrennan (pigmonkey)
Last Packager: pigmonkey
Votes: 122
Popularity: 2.39
First Submitted: 2014-01-27 11:36 (UTC)
Last Updated: 2024-11-11 23:40 (UTC)

Pinned Comments

fbrennan commented on 2023-05-12 22:54 (UTC)

The flag was invalid and has been removed with no action taken as no new version was released. There's nothing to do for this package; no new release has been made. Rebuild, as @eclairevoyant has said.

Latest Comments

« First ‹ Previous 1 2 3 4 5 6 7 8 9 10 11 12 .. 22 Next › Last »

jbarlow commented on 2021-12-24 00:06 (UTC)

@nottoday Python-pikepdf is likely out of date.

nottoday commented on 2021-12-23 16:49 (UTC) (edited on 2021-12-24 13:35 (UTC) by nottoday)

I have a problem that has started since 13.0.0.

The following command

$ ocrmypdf ./test.scan.pdf ./test.pdf

gives the following error output.

    1 [tesseract] lots of diacritics - possibly poor OCR
An exception occurred while executing the pipeline
Traceback (most recent call last):
  File "/usr/lib/python3.9/site-packages/ocrmypdf/_sync.py", line 385, in run_pipeline
    exec_concurrent(context, executor)
  File "/usr/lib/python3.9/site-packages/ocrmypdf/_sync.py", line 274, in exec_concurrent
    executor(
  File "/usr/lib/python3.9/site-packages/ocrmypdf/_concurrent.py", line 82, in __call__
    self._execute(
  File "/usr/lib/python3.9/site-packages/ocrmypdf/builtin_plugins/concurrency.py", line 136, in _execute
    task_finished(result, pbar)
  File "/usr/lib/python3.9/site-packages/ocrmypdf/_sync.py", line 264, in update_page
    ocrgraft.graft_page(
  File "/usr/lib/python3.9/site-packages/ocrmypdf/_graft.py", line 142, in graft_page
    self._graft_text_layer(
  File "/usr/lib/python3.9/site-packages/ocrmypdf/_graft.py", line 304, in _graft_text_layer
    base_page.contents_add(new_text_layer, prepend=True)
AttributeError: contents_add

I'm on Manjaro in case that makes a difference.

Thanks in advance.

malacology commented on 2021-12-14 00:59 (UTC)

okay, thanks it is solved

pigmonkey commented on 2021-12-14 00:50 (UTC)

Python AUR packages need to be rebuilt after Python upgrades.

The version bump I just pushed for 13.1.1 will cause this package to get rebuilt, however you will need to manually rebuild any AUR Python dependencies which have not incremented their pkgrel for the new Python (python-coloredlogs, python-humanfriendly). There's nothing we can do about those from this package.

malacology commented on 2021-12-13 22:44 (UTC)

ocrmypdf                                                                                                                ░▒▓ ✔  22:43:39  ▓▒░
Traceback (most recent call last):
  File "/usr/bin/ocrmypdf", line 33, in <module>
    sys.exit(load_entry_point('ocrmypdf==13.1.0', 'console_scripts', 'ocrmypdf')())
  File "/usr/bin/ocrmypdf", line 22, in importlib_load_entry_point
    for entry_point in distribution(dist_name).entry_points
  File "/usr/lib/python3.10/importlib/metadata/__init__.py", line 919, in distribution
    return Distribution.from_name(distribution_name)
  File "/usr/lib/python3.10/importlib/metadata/__init__.py", line 518, in from_name
    raise PackageNotFoundError(name)
importlib.metadata.PackageNotFoundError: No package metadata was found for ocrmypdf

After upgrade to python 3.10

https://github.com/ocrmypdf/OCRmyPDF/issues/872#issuecomment-992025153

jvn01 commented on 2021-10-02 13:53 (UTC)

Give me error " python-distlib-0.3.2-1-any.pkg.tar.zst failed to download"

bot198042362134 commented on 2021-09-23 07:48 (UTC)

There are two missing dependencies: tesseract-data-eng and python-sortedcontainers

To solve this issue simply do:

pacman -S tesseract-data-eng python-sortedcontainers

lightsaber commented on 2021-08-24 18:31 (UTC)

Got this traceback:

Traceback (most recent call last):
  File "/usr/bin/ocrmypdf", line 33, in <module>
    sys.exit(load_entry_point('ocrmypdf==12.3.2', 'console_scripts', 'ocrmypdf')())
  File "/usr/bin/ocrmypdf", line 25, in importlib_load_entry_point
    return next(matches).load()
  File "/usr/lib/python3.9/importlib/metadata.py", line 77, in load
    module = import_module(match.group('module'))
  File "/usr/lib/python3.9/importlib/__init__.py", line 127, in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
  File "<frozen importlib._bootstrap>", line 1030, in _gcd_import
  File "<frozen importlib._bootstrap>", line 1007, in _find_and_load
  File "<frozen importlib._bootstrap>", line 972, in _find_and_load_unlocked
  File "<frozen importlib._bootstrap>", line 228, in _call_with_frames_removed
  File "<frozen importlib._bootstrap>", line 1030, in _gcd_import
  File "<frozen importlib._bootstrap>", line 1007, in _find_and_load
  File "<frozen importlib._bootstrap>", line 986, in _find_and_load_unlocked
  File "<frozen importlib._bootstrap>", line 680, in _load_unlocked
  File "<frozen importlib._bootstrap_external>", line 850, in exec_module
  File "<frozen importlib._bootstrap>", line 228, in _call_with_frames_removed
  File "/usr/lib/python3.9/site-packages/ocrmypdf/__init__.py", line 10, in <module>
    from ocrmypdf import helpers, hocrtransform, leptonica, pdfa, pdfinfo
  File "/usr/lib/python3.9/site-packages/ocrmypdf/helpers.py", line 22, in <module>
    import pikepdf
  File "/usr/lib/python3.9/site-packages/pikepdf/__init__.py", line 19, in <module>
    from ._version import __version__
  File "/usr/lib/python3.9/site-packages/pikepdf/_version.py", line 7, in <module>
    from pkg_resources import DistributionNotFound
  File "/usr/lib/python3.9/site-packages/pkg_resources/__init__.py", line 3243, in <module>
    def _initialize_master_working_set():
  File "/usr/lib/python3.9/site-packages/pkg_resources/__init__.py", line 3226, in _call_aside
    f(*args, **kwargs)
  File "/usr/lib/python3.9/site-packages/pkg_resources/__init__.py", line 3255, in _initialize_master_working_set
    working_set = WorkingSet._build_master()
  File "/usr/lib/python3.9/site-packages/pkg_resources/__init__.py", line 568, in _build_master
    ws.require(__requires__)
  File "/usr/lib/python3.9/site-packages/pkg_resources/__init__.py", line 886, in require
    needed = self.resolve(parse_requirements(requirements))
  File "/usr/lib/python3.9/site-packages/pkg_resources/__init__.py", line 772, in resolve
    raise DistributionNotFound(req, requirers)
pkg_resources.DistributionNotFound: The 'coloredlogs>=14.0' distribution was not found and is required by ocrmypdf