Package Details: tesseract-git 1:5.0.1.r9.g31a96843-1

Git Clone URL: https://aur.archlinux.org/tesseract-git.git (read-only, click to copy)
Package Base: tesseract-git
Description: Open Source OCR Engine
Upstream URL: https://tesseract-ocr.github.io/
Licenses: Apache
Conflicts: tesseract
Provides: tesseract
Submitter: stativ
Maintainer: FabioLolix
Last Packager: FabioLolix
Votes: 32
Popularity: 0.000000
First Submitted: 2015-07-02 20:30 (UTC)
Last Updated: 2022-01-23 21:46 (UTC)

Required by (221)

Sources (2)

Latest Comments

1 2 Next › Last »

Sematre commented on 2022-01-12 17:29 (UTC)

As there now seems to be a new maintainer, I will just post my updated PKGBUILD file here:

# Maintainer: /
# Contributor: Sematre <sematre at gmx dot de>
# Contributor: Lukas Jirkovsky <l.jirkovsky@gmail.com>
pkgname=tesseract-git
pkgver=5.0.0.r27.gbf7c134b
pkgrel=1

pkgdesc="Open Source text recognition (OCR) Engine"
arch=('any')
url="https://github.com/tesseract-ocr/tesseract"
license=('Apache')

depends=('libpng' 'libtiff' 'libjpeg-turbo' 'zlib' 'giflib' 'gcc-libs' 'curl' 'libarchive' 'leptonica' 'gcc-libs' 'libtiff' 'libpng' 'leptonica' 'giflib')
optdepends=('pango: text2image' 'icu: text2image' 'tesseract-data-git: Trained language data')
makedepends=('git' 'icu' 'cairo' 'pango' 'asciidoc')

provides=("${pkgname%-git}")
conflicts=("${pkgname%-git}")

install=tesseract.install
source=("${pkgname%-git}::git+${url}.git")
sha256sums=('SKIP')

pkgver() {
  cd "${pkgname%-git}"
  git describe --long --tags | sed 's/\([^-]*-g\)/r\1/;s/-/./g'
}

build() {
  cd "${srcdir}/${pkgname%-git}"

  ./autogen.sh
  ./configure --prefix=/usr

  make
  make training
}

package() {
  cd "${srcdir}/${pkgname%-git}"

  make DESTDIR="${pkgdir}" install
  make DESTDIR="${pkgdir}" training-install
}

Sematre commented on 2021-12-26 18:09 (UTC) (edited on 2021-12-26 18:12 (UTC) by Sematre)

I just send an e-mail to the maintainer and suggested some changes to the PKGBUILD file. Let's hope he/she sees it.

LiteracyFanatic commented on 2021-09-13 11:43 (UTC)

Speaking of pkgver(), this package fails to build now because the default branch was renamed from master to main (sigh). Changing master to HEAD in pkgver() fixes the issue.

FabioLolix commented on 2021-09-10 20:55 (UTC)

This pkgbuild, like all your other VCS pkgbuilds, don't have a proper pkgver(), see https://wiki.archlinux.org/title/VCS_package_guidelines#Git

stativ commented on 2017-03-29 16:39 (UTC)

iamdrrot: thank you, I added the dependency.

iamdrrot commented on 2017-03-29 00:33 (UTC)

ran into an issue building on a fresh system. autoconf-archive is an build dep.

stativ commented on 2015-07-02 20:32 (UTC)

tesseract-svn merged into tesseract-git

hak8or commented on 2015-05-31 01:00 (UTC)

For anyone here getting issues with this compiling, specifically when using it with the Tesseract-OCR ruby gem, it's beceause there were changes on the svn repo which messes things up. https://code.google.com/p/python-tesseract/issues/detail?id=63 "Don't use the svn version right now. zdenko makes some changes in the svn version which is a bit tricky to wrap." So I assume we can only wait.

misc commented on 2012-06-30 15:01 (UTC)

build and package need to be updated according to https://code.google.com/p/tesseract-ocr/wiki/TesseractSvnInstallation . Relevant changes below. build: ./autogen.sh ./configure --prefix=/usr make package: {*.traineddata,*cube.*} Note that 3.02 seems to explode the size — it grew threefold!

stativ commented on 2011-11-17 12:18 (UTC)

Thanks, fixed. I've also added installation of trained data which was missing in the the previous PKGBUILD.