From patchwork Fri Apr 26 11:42:00 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Julian Flake X-Patchwork-Id: 63406 Return-Path: X-Original-To: patchwork@mira.cbaines.net Delivered-To: patchwork@mira.cbaines.net Received: by mira.cbaines.net (Postfix, from userid 113) id A4E3D27BBEA; Fri, 26 Apr 2024 12:48:09 +0100 (BST) X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on mira.cbaines.net X-Spam-Level: X-Spam-Status: No, score=-2.7 required=5.0 tests=BAYES_00,DKIM_INVALID, DKIM_SIGNED,MAILING_LIST_MULTI,SPF_HELO_PASS,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) by mira.cbaines.net (Postfix) with ESMTPS id DBAC227BBE2 for ; Fri, 26 Apr 2024 12:48:08 +0100 (BST) Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1s0K3I-0004y1-Lr; Fri, 26 Apr 2024 07:47:40 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1s0K3G-0004wn-BP for guix-patches@gnu.org; Fri, 26 Apr 2024 07:47:38 -0400 Received: from debbugs.gnu.org ([2001:470:142:5::43]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1s0K3G-00048e-2f for guix-patches@gnu.org; Fri, 26 Apr 2024 07:47:38 -0400 Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1s0K3X-0008VO-G5 for guix-patches@gnu.org; Fri, 26 Apr 2024 07:47:55 -0400 X-Loop: help-debbugs@gnu.org Subject: [bug#70222] [PATCH v2 06/14] gnu: Add perl-web-scraper. Resent-From: Julian Flake Original-Sender: "Debbugs-submit" Resent-CC: guix-patches@gnu.org Resent-Date: Fri, 26 Apr 2024 11:47:54 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 70222 X-GNU-PR-Package: guix-patches X-GNU-PR-Keywords: patch To: 70222@debbugs.gnu.org Cc: steve@futurile.net X-Debbugs-Original-Cc: Steve George , guix-patches@gnu.org Received: via spool by 70222-submit@debbugs.gnu.org id=B70222.171413205332378 (code B ref 70222); Fri, 26 Apr 2024 11:47:54 +0000 Received: (at 70222) by debbugs.gnu.org; 26 Apr 2024 11:47:33 +0000 Received: from localhost ([127.0.0.1]:34176 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1s0K36-0008PN-2k for submit@debbugs.gnu.org; Fri, 26 Apr 2024 07:47:31 -0400 Received: from nsmtp.uni-koblenz.de ([141.26.64.14]:43905) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1s0JzI-0007YK-0n for 70222@debbugs.gnu.org; Fri, 26 Apr 2024 07:43:35 -0400 Received: from localhost (localhost [127.0.0.1]) by nsmtp.uni-koblenz.de (Postfix) with ESMTP id 5342A1A0038; Fri, 26 Apr 2024 13:42:40 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=uni-koblenz.de; h=content-transfer-encoding:mime-version:references:in-reply-to :x-mailer:message-id:date:date:subject:subject:from:from :received:received:received; s=mail; t=1714131760; x=1715946161; bh=srjvuxSqSr7yLeHnnc0Fwd0NGUEK46f2KnsO9Rzmzd0=; b=L0EgFOPSkbcB sdXg+Ie++v+JIMZ4fDPgZnFlQXRwN1ifEsfIzfHyY8pw1IWIjnfY6QbxDb1AdOeY 2iX6ODIRfP7vyDfyZNEuJWf7IpJ5sEH5z8gDwarTQefZChef1EbDMWP7vmi2byIg c+TRwgFg448GeI63Wjd5PARz+OT4nro= X-Virus-Scanned: Debian amavisd-new at uni-koblenz.de Received: from nsmtp.uni-koblenz.de ([127.0.0.1]) by localhost (nsmtp.uni-koblenz.de [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 5X9DGgGcHl6f; Fri, 26 Apr 2024 13:42:40 +0200 (CEST) Received: from mailproxy.uni-koblenz.de (mailproxy.uni-koblenz.de [141.26.64.230]) by nsmtp.uni-koblenz.de (Postfix) with ESMTPS; Fri, 26 Apr 2024 13:42:40 +0200 (CEST) Received: from nutbook.fritz.box (unknown [IPv6:2001:9e8:5ee9:af00:82e3:bead:ab1a:79e1]) by mailproxy.uni-koblenz.de (Postfix) with ESMTPSA id 23F5910052F; Fri, 26 Apr 2024 13:42:40 +0200 (CEST) From: Julian Flake Date: Fri, 26 Apr 2024 13:42:00 +0200 Message-ID: <4de823a7220e9efde67cc1db70d1bbf8e7549758.1714130775.git.flake@uni-koblenz.de> X-Mailer: git-send-email 2.41.0 In-Reply-To: References: MIME-Version: 1.0 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: guix-patches@gnu.org List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-patches-bounces+patchwork=mira.cbaines.net@gnu.org Sender: guix-patches-bounces+patchwork=mira.cbaines.net@gnu.org X-getmail-retrieved-from-mailbox: Patches From: Steve George * gnu/packages/web.scm (perl-web-scraper): New variable. Change-Id: I3d0a476ca6c09bc20391303f99b260410f16de65 --- gnu/packages/web.scm | 32 ++++++++++++++++++++++++++++++++ 1 file changed, 32 insertions(+) diff --git a/gnu/packages/web.scm b/gnu/packages/web.scm index 28ca51416a..de8847fac0 100644 --- a/gnu/packages/web.scm +++ b/gnu/packages/web.scm @@ -4778,6 +4778,38 @@ (define-public perl-uri-ws methods for WebSocket URIs as it does for HTTP URIs.") (license license:perl-license))) +(define-public perl-web-scraper + (package + (name "perl-web-scraper") + (version "0.38") + (source + (origin + (method url-fetch) + (uri (string-append "mirror://cpan/authors/id/M/MI/MIYAGAWA/Web-Scraper-" + version ".tar.gz")) + (sha256 + (base32 "1gs3fmbc83j34c0sig1hkpnm26ngnyi5kgq5dl8vxvkzimgnwnzr")))) + (build-system perl-build-system) + (native-inputs (list perl-module-build-tiny + perl-test-base + perl-test-requires)) + (propagated-inputs (list perl-html-parser + perl-html-selector-xpath + perl-html-tagset + perl-html-tree + perl-html-treebuilder-xpath + perl-libwww + perl-universal-require + perl-uri + perl-xml-xpathengine + perl-yaml)) + (home-page "https://metacpan.org/release/Web-Scraper") + (synopsis + "Web Scraping toolkit using HTML and CSS Selectors or XPath expressions") + (description "Perl module @code{Web::Scraper} is a toolkit for +traversing and scraping sites, inspired by Ruby's Scapi.") + (license license:perl-license))) + (define-public perl-uri-template (package (name "perl-uri-template")