From patchwork Mon Feb 8 06:42:52 2021 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Patchwork-Submitter: Andy Tai X-Patchwork-Id: 26950 Return-Path: X-Original-To: patchwork@mira.cbaines.net Delivered-To: patchwork@mira.cbaines.net Received: by mira.cbaines.net (Postfix, from userid 113) id A8A6B27BC21; Mon, 8 Feb 2021 11:58:48 +0000 (GMT) X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on mira.cbaines.net X-Spam-Level: X-Spam-Status: No, score=-2.8 required=5.0 tests=BAYES_00,DKIM_SIGNED, MAILING_LIST_MULTI,RCVD_IN_MSPIKE_H4,RCVD_IN_MSPIKE_WL,SPF_HELO_PASS, T_DKIM_INVALID,URIBL_BLOCKED autolearn=unavailable autolearn_force=no version=3.4.2 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) by mira.cbaines.net (Postfix) with ESMTPS id 1397C27BC1E for ; Mon, 8 Feb 2021 11:58:48 +0000 (GMT) Received: from localhost ([::1]:50970 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1l95Bn-0000L3-2y for patchwork@mira.cbaines.net; Mon, 08 Feb 2021 06:58:47 -0500 Received: from eggs.gnu.org ([2001:470:142:3::10]:40862) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1l90ID-0003td-Jh for guix-patches@gnu.org; Mon, 08 Feb 2021 01:45:12 -0500 Received: from debbugs.gnu.org ([209.51.188.43]:37969) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1l90I9-0001NO-Ow for guix-patches@gnu.org; Mon, 08 Feb 2021 01:45:04 -0500 Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1l90I9-0007Gy-MJ for guix-patches@gnu.org; Mon, 08 Feb 2021 01:45:01 -0500 X-Loop: help-debbugs@gnu.org Subject: [bug#46376] [PATCH] gnu: tesseract-ocr: update to 4.1.1 Resent-From: Andy Tai Original-Sender: "Debbugs-submit" Resent-CC: guix-patches@gnu.org Resent-Date: Mon, 08 Feb 2021 06:45:01 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: report 46376 X-GNU-PR-Package: guix-patches X-GNU-PR-Keywords: patch To: 46376@debbugs.gnu.org X-Debbugs-Original-To: guix-patches@gnu.org Received: via spool by submit@debbugs.gnu.org id=B.161276666727875 (code B ref -1); Mon, 08 Feb 2021 06:45:01 +0000 Received: (at submit) by debbugs.gnu.org; 8 Feb 2021 06:44:27 +0000 Received: from localhost ([127.0.0.1]:49515 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1l90Ha-0007FW-QH for submit@debbugs.gnu.org; Mon, 08 Feb 2021 01:44:27 -0500 Received: from lists.gnu.org ([209.51.188.17]:51268) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1l90HZ-0007FP-K8 for submit@debbugs.gnu.org; Mon, 08 Feb 2021 01:44:26 -0500 Received: from eggs.gnu.org ([2001:470:142:3::10]:40432) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1l90HS-0003ml-Qj for guix-patches@gnu.org; Mon, 08 Feb 2021 01:44:21 -0500 Received: from mail-il1-x130.google.com ([2607:f8b0:4864:20::130]:40785) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1l90H8-00010R-UP for guix-patches@gnu.org; Mon, 08 Feb 2021 01:44:07 -0500 Received: by mail-il1-x130.google.com with SMTP id e7so11744812ile.7 for ; Sun, 07 Feb 2021 22:43:29 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=atai-org.20150623.gappssmtp.com; s=20150623; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :content-transfer-encoding; bh=YBaQg5Rb1oa5RZoxpKsVKJl7LhpZAZ3ImD0H4m7Dqsg=; b=nTj7FMqZ7TDfSGZC3eCxPyKNiagj2mnxepwLfcD6GyCEeMpyY1GEw0d9i1OKgyZAra ex2G9Y7jLvH4ap2dKQZjzcyAmYTsN4DvDU+0bmdAMjlhi7YHF0UffsjCctkIUGXB7r47 unh69z8iffeo8M4I1Ff6aOEtu5T63NGAa3LYs92fBOK0KviC1duliuXixySWScaxdbHD lvQwuRytY28WAl+mFQJv2jzGswM+1Wag8LjhBMaF/uFo5/cJSwGJ4RBo90c0abRopu5z j2ooTB2dwjW3Wb+MENJfpY2L/3gWxmbun286Gk1UtidtfmMT8c/ey3AWNi/3/+6NZ2OK ThMg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:content-transfer-encoding; bh=YBaQg5Rb1oa5RZoxpKsVKJl7LhpZAZ3ImD0H4m7Dqsg=; b=HWZTVx8wIEaDz/rPERIr8oXYMqwhK4owt6Fr5OvavEIGGXNoGt6ro4GOvQnO3xJIUg 9eUKAIxsewuHmoQB1onBDKT3D6pMefICogjLp9mwLPRgoEK8GlZ07Wf3wHDCTwftx0M+ c6g3mjPovoQIzireoWeBJPfmP6E3wbuTbdzu7ibfnWFPmzvYqUSc4QjcqAP67HUPKHp1 e9jNDqJK7TlWHapXzF+RVzkYhgQUtb5LpnAdk/9XT7/WTdgVw/vBm5em0/hkIp4ir1Np vTklqou5c+gj631wVl9N+JeAxkwwQ8NJ4nnWHZHMFh6mXpHEIz1WhkA5/if6aMLsjhEt GD8Q== X-Gm-Message-State: AOAM533uKI4UnzukK0bMhHtz+7sjPMarTxEicBENnak3Ujh376Qk8m7I 9cAP9uRB8lQpUOgzDt82gpTsUVFMdOg06m/JUPzpIBVTrCI= X-Google-Smtp-Source: ABdhPJxIG3Ce2YCwpkgFj9wBUBjq2MarIhgsPqDyCQAMY1O522Ps86iN0+pBr5d8SEyuuUkAN98P3E+bebMEMFRWP/I= X-Received: by 2002:a92:d7d0:: with SMTP id g16mr14215447ilq.105.1612766608381; Sun, 07 Feb 2021 22:43:28 -0800 (PST) MIME-Version: 1.0 References: In-Reply-To: From: Andy Tai Date: Sun, 7 Feb 2021 22:42:52 -0800 Message-ID: Received-SPF: pass client-ip=2607:f8b0:4864:20::130; envelope-from=lichengtai@gmail.com; helo=mail-il1-x130.google.com X-Spam_score_int: -13 X-Spam_score: -1.4 X-Spam_bar: - X-Spam_report: (-1.4 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, FREEMAIL_FORGED_FROMDOMAIN=0.248, FREEMAIL_FROM=0.001, HEADER_FROM_DIFFERENT_DOMAINS=0.248, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=no autolearn_force=no X-Spam_action: no action X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: guix-patches@gnu.org List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-patches-bounces+patchwork=mira.cbaines.net@gnu.org Sender: "Guix-patches" X-getmail-retrieved-from-mailbox: Patches * gnu/packages/ocr.scm (tesseract-ocr): update to 4.1.1 --- gnu/packages/ocr.scm | 38 +++++++++++++++++++++++++++++++------- 1 file changed, 31 insertions(+), 7 deletions(-) (let* ((commit "581faa8f6f15e4a7b21964be3a5ec36265c80e5b") -- 2.30.0 diff --git a/gnu/packages/ocr.scm b/gnu/packages/ocr.scm index dc4930918a..7fc3929684 100644 --- a/gnu/packages/ocr.scm +++ b/gnu/packages/ocr.scm @@ -3,6 +3,7 @@ ;;; Copyright © 2016, 2020 Efraim Flashner ;;; Copyright © 2019 Tobias Geerinckx-Rice ;;; Copyright © 2019 Alex Vong +;;; Copyright © 2021 Andy Tai ;;; ;;; This file is part of GNU Guix. ;;; @@ -26,7 +27,12 @@ #:use-module (guix git-download) #:use-module (guix build-system gnu) #:use-module (guix build-system python) + #:use-module (gnu packages autotools) + #:use-module (gnu packages check) #:use-module (gnu packages compression) + #:use-module (gnu packages gtk) + #:use-module (gnu packages icu4c) + #:use-module (gnu packages pkg-config) #:use-module (gnu packages python) #:use-module (gnu packages image)) @@ -52,25 +58,43 @@ it produces text in 8-bit or UTF-8 formats.") (license license:gpl3+))) (define-public tesseract-ocr + ;; some useful commits beyond last official stable release in release branch + (let ((commit "97079fa353557af6df86fd20b5d2e0dff5d8d5df")) (package (name "tesseract-ocr") - (version "3.04.01") + (version "4.1.1") (source (origin (method git-fetch) (uri (git-reference (url "https://github.com/tesseract-ocr/tesseract") - (commit version))) - (file-name (git-file-name name version)) + (commit commit) + ;; Fetch git submodules otherwise typeshed is not fetched. + ;; Typeshed is a collection of Python sources type annotation + ;; (data) files. + (recursive? #t))) + (file-name (string-append name "-" version "-" commit "-checkout")) (sha256 - (base32 "0h1x4z1h86n2gwknd0wck6gykkp99bmm02lg4a47a698g4az6ybv")))) + (base32 "0axwla82fpzp86lc553wp3hk0fz5dylw4as0jbf4hkqcyajlbzp4")))) (build-system gnu-build-system) (inputs - `(("leptonica" ,leptonica))) + `( ("cairo" ,cairo) + ("icu" ,icu4c) + ("leptonica" ,leptonica) + ("pango" ,pango))) + (native-inputs + `(("autoconf" ,autoconf) + ("autoconf-archive" ,autoconf-archive) + ("automake" ,automake) + ("googletest" ,googletest) + ("libtool" ,libtool) + ("pkg-config" ,pkg-config))) (arguments '(#:configure-flags (let ((leptonica (assoc-ref %build-inputs "leptonica"))) - (list (string-append "LIBLEPT_HEADERSDIR=" leptonica "/include"))))) + (list (string-append "LIBLEPT_HEADERSDIR=" leptonica "/include"))) + ;; some test, applybox_test fails to build + #:tests? #f)) (home-page "https://github.com/tesseract-ocr/tesseract") (synopsis "Optical character recognition engine") (description @@ -79,7 +103,7 @@ high accuracy. It supports many languages, output text formatting, hOCR positional information and page layout analysis. Several image formats are supported through the Leptonica library. It can also detect whether text is monospaced or proportional.") - (license license:asl2.0))) + (license license:asl2.0)))) (define-public zinnia