From patchwork Mon Jul 4 19:42:02 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Julien Lepiller X-Patchwork-Id: 40513 Return-Path: X-Original-To: patchwork@mira.cbaines.net Delivered-To: patchwork@mira.cbaines.net Received: by mira.cbaines.net (Postfix, from userid 113) id 1CF3727BBEA; Mon, 4 Jul 2022 20:43:16 +0100 (BST) X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on mira.cbaines.net X-Spam-Level: X-Spam-Status: No, score=-2.7 required=5.0 tests=BAYES_00,DKIM_INVALID, DKIM_SIGNED,MAILING_LIST_MULTI,SPF_HELO_PASS,URIBL_BLOCKED autolearn=unavailable autolearn_force=no version=3.4.6 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) by mira.cbaines.net (Postfix) with ESMTPS id C268027BBE9 for ; Mon, 4 Jul 2022 20:43:15 +0100 (BST) Received: from localhost ([::1]:42494 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1o8RyU-0000Uu-TH for patchwork@mira.cbaines.net; Mon, 04 Jul 2022 15:43:14 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:44130) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1o8RyI-0000O6-RA for guix-patches@gnu.org; Mon, 04 Jul 2022 15:43:02 -0400 Received: from debbugs.gnu.org ([209.51.188.43]:54794) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1o8RyI-0007PE-IZ for guix-patches@gnu.org; Mon, 04 Jul 2022 15:43:02 -0400 Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1o8RyI-0007s6-Gp for guix-patches@gnu.org; Mon, 04 Jul 2022 15:43:02 -0400 X-Loop: help-debbugs@gnu.org Subject: [bug#56386] [PATCH 3/3] gnu: Add mecab-unidic. Resent-From: Julien Lepiller Original-Sender: "Debbugs-submit" Resent-CC: guix-patches@gnu.org Resent-Date: Mon, 04 Jul 2022 19:43:02 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 56386 X-GNU-PR-Package: guix-patches X-GNU-PR-Keywords: patch To: 56386@debbugs.gnu.org Received: via spool by 56386-submit@debbugs.gnu.org id=B56386.165696375130204 (code B ref 56386); Mon, 04 Jul 2022 19:43:02 +0000 Received: (at 56386) by debbugs.gnu.org; 4 Jul 2022 19:42:31 +0000 Received: from localhost ([127.0.0.1]:48689 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1o8Rxm-0007r0-Lp for submit@debbugs.gnu.org; Mon, 04 Jul 2022 15:42:30 -0400 Received: from lepiller.eu ([89.234.186.109]:42874) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1o8Rxe-0007qQ-QS for 56386@debbugs.gnu.org; Mon, 04 Jul 2022 15:42:23 -0400 Received: from lepiller.eu (localhost [127.0.0.1]) by lepiller.eu (OpenSMTPD) with ESMTP id e4284eb2 for <56386@debbugs.gnu.org>; Mon, 4 Jul 2022 19:42:15 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed; d=lepiller.eu; h=from:to :subject:date:message-id:in-reply-to:references:mime-version :content-transfer-encoding; s=dkim; bh=ozRQurRJvGKaK9p9OKjiDK2TW oWOj/ixDSOtLZyJqWs=; b=Y4KD1mYnG0jMKuHqGoCmIOkhGBnCM0K+GQZE5K4ua 4kQiq59eWi7y/gjRZU2YKUagThexZgNvtrZceMm1nvFUMXG3DDldcyR1KpjqRnRk iIWfG2no15Sq0kz9NtBJ/wgAGvOeqiWMaCEVnLYtjGREEu8tcgEFBCzzSsTO6TRr 7eoTjzEBYwXKpsr/raCmft12QOxOB5XbQjcIFO11eQOBwrqjiA5av5U7J9Tb0xkZ baG1f3oAcUcNWyny50Ijp13NYjZFDYazdqvpPj6uP8j8I9IprepXKY78CQOyQKdP /lgRjCN2fnt8ElbRbZ/kY/DXo91wjDBImd1jjtFG0eQBA== Received: by lepiller.eu (OpenSMTPD) with ESMTPSA id 6009996d (TLSv1.3:AEAD-AES256-GCM-SHA384:256:NO) for <56386@debbugs.gnu.org>; Mon, 4 Jul 2022 19:42:14 +0000 (UTC) From: Julien Lepiller Date: Mon, 4 Jul 2022 21:42:02 +0200 Message-Id: <20220704194202.30958-3-julien@lepiller.eu> X-Mailer: git-send-email 2.36.1 In-Reply-To: <20220704194202.30958-1-julien@lepiller.eu> References: <20220704194202.30958-1-julien@lepiller.eu> MIME-Version: 1.0 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: guix-patches@gnu.org List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-patches-bounces+patchwork=mira.cbaines.net@gnu.org Sender: "Guix-patches" X-getmail-retrieved-from-mailbox: Patches * gnu/packages/language.scm (mecab-unidic): New variable. --- gnu/packages/language.scm | 26 ++++++++++++++++++++++++++ 1 file changed, 26 insertions(+) diff --git a/gnu/packages/language.scm b/gnu/packages/language.scm index 63654c544b..f97b982cb9 100644 --- a/gnu/packages/language.scm +++ b/gnu/packages/language.scm @@ -27,6 +27,7 @@ (define-module (gnu packages language) #:use-module (gnu packages autotools) #:use-module (gnu packages audio) #:use-module (gnu packages base) + #:use-module (gnu packages compression) #:use-module (gnu packages docbook) #:use-module (gnu packages emacs) #:use-module (gnu packages freedesktop) @@ -57,6 +58,7 @@ (define-module (gnu packages language) #:use-module (gnu packages xorg) #:use-module (guix packages) #:use-module (guix build-system cmake) + #:use-module (guix build-system copy) #:use-module (guix build-system glib-or-gtk) #:use-module (guix build-system gnu) #:use-module (guix build-system perl) @@ -997,3 +999,27 @@ (define-public mecab-ipadic (description "This package contains dictionnary data derived from ipadic for use with MeCab.") (license (license:non-copyleft "mecab-ipadic/COPYING")))) + +(define-public mecab-unidic + (package + (name "mecab-unidic") + (version "3.1.0") + (source (origin + (method url-fetch) + (uri (string-append "https://clrd.ninjal.ac.jp/unidic_archive/cwj/" + version "/unidic-cwj-" version ".zip")) + (sha256 + (base32 + "1z132p2q3bgchiw529j2d7dari21kn0fhkgrj3vcl0ncg2m521il")))) + (build-system copy-build-system) + (arguments + `(#:install-plan + '(("." "lib/mecab/dic" + #:include-regexp ("\\.bin$" "\\.def$" "\\.dic$" "dicrc"))))) + (native-inputs (list unzip)) + (home-page "https://clrd.ninjal.ac.jp/unidic/en/") + (synopsis "Dictionary data for MeCab") + (description "UniDic for morphological analysis is a dictionary for +analysis with the morphological analyser MeCab, where the short units exported +from the database are used as entries (heading terms).") + (license (list license:gpl2+ license:lgpl2.1 license:bsd-3))))