Wonderful software and software choices. Thank you !
Search Criteria
Package Details: ocrmypdf 17.4.2-1
Package Actions
| Git Clone URL: | https://aur.archlinux.org/ocrmypdf.git (read-only, click to copy) |
|---|---|
| Package Base: | ocrmypdf |
| Description: | A tool to add an OCR text layer to scanned PDF files, allowing them to be searched |
| Upstream URL: | https://github.com/ocrmypdf/OCRmyPDF |
| Licenses: | MPL2 |
| Submitter: | dreuter |
| Maintainer: | fbrennan (pigmonkey) |
| Last Packager: | pigmonkey |
| Votes: | 152 |
| Popularity: | 5.95 |
| First Submitted: | 2014-01-27 11:36 (UTC) |
| Last Updated: | 2026-04-21 19:18 (UTC) |
Dependencies (24)
- ghostscript (ghostscript-gitAUR)
- img2pdf (img2pdf-gitAUR)
- pngquant
- python
- python-deprecation
- python-fpdf2AUR
- python-importlib_resources
- python-packaging
- python-pdfminer
- python-pikepdf
- python-pillow (python-pillow-simd-gitAUR, hosaka-studioAUR)
- python-pluggy
- python-pydantic
- python-reportlab
- python-rich
- python-tqdm
- python-uharfbuzz
- tesseract (tesseract-gitAUR)
- unpaper (unpaper-gitAUR)
- python-build (make)
- Show 4 more dependencies...
Required by (6)
- docspell-joex (optional)
- dpsprep (optional)
- dpsprep-git (optional)
- phoronix-test-suite-git (optional)
- riven-original-soundtrack (make)
- stirling-pdf-bin
Sources (1)
maison commented on 2026-05-15 16:20 (UTC)
EgidioCaprino commented on 2026-04-30 12:26 (UTC)
Thank you @dulzamacis 🙏
dulzamacis commented on 2026-04-29 06:35 (UTC) (edited on 2026-04-29 07:49 (UTC) by dulzamacis)
ocrmypdf is breaking since the system updated python-uharfbuzz to version 0.54.0-1. I had to downgrade to the old version 0.53.7-1 and everything is working again. This is the error:
ImportError: /usr/lib/python3.14/site-packages/uharfbuzz/_harfbuzz.abi3.so: undefined symbol: hb_raster_image_get_extents
jav209 commented on 2026-04-24 13:08 (UTC)
I'm loving this package. Thanks to the maintainers. Hope this will make it into a repository soon.
martin-de commented on 2026-02-02 12:00 (UTC) (edited on 2026-02-02 12:01 (UTC) by martin-de)
Since the update to Python 3.14 and the transition to OCRmyPDF v17.x, the current depends array is incomplete, leading to ModuleNotFoundError at runtime. Please add the following mandatory dependencies:
python-pydantic (required for OcrOptions validation)
python-uharfbuzz (required for text shaping)
python-fpdf2 (required for the new default PDF renderer)
Without these, the application fails to start even if the build process completes successfully.
jbarlow commented on 2026-01-31 08:07 (UTC)
Upstream here - v17 introduces some new features and new recommended dependencies:
https://github.com/ocrmypdf/OCRmyPDF/blob/main/docs/maintainers.md#dependency-matrix-for-packagers
mostafatouny commented on 2025-04-18 19:48 (UTC)
If anyone faced: ImportError(_msg) from _e ImportError: pikepdf's extension library failed to import
Try pip check or pip install pikepdf and install any missing dependencies.
Then re-compile: pip install ocrmypdf --force
ioan commented on 2024-06-26 18:24 (UTC)
The optional dependency 'jbig2' was not found, so some image optimizations could not be attempted.
jbig2enc
wat
Pinned Comments
fbrennan commented on 2023-05-12 22:54 (UTC)
The flag was invalid and has been removed with no action taken as no new version was released. There's nothing to do for this package; no new release has been made. Rebuild, as @eclairevoyant has said.