From patchwork Fri Apr 5 20:10:12 2024 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Steve George X-Patchwork-Id: 62742 Return-Path: X-Original-To: patchwork@mira.cbaines.net Delivered-To: patchwork@mira.cbaines.net Received: by mira.cbaines.net (Postfix, from userid 113) id 1CD7D27BBE9; Fri, 5 Apr 2024 21:12:54 +0100 (BST) X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on mira.cbaines.net X-Spam-Level: X-Spam-Status: No, score=-2.7 required=5.0 tests=BAYES_00,DKIM_INVALID, DKIM_SIGNED,MAILING_LIST_MULTI,SPF_HELO_PASS,URIBL_BLOCKED autolearn=unavailable autolearn_force=no version=3.4.6 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) by mira.cbaines.net (Postfix) with ESMTPS id C422127BBE2 for ; Fri, 5 Apr 2024 21:12:52 +0100 (BST) Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1rspuu-0001Mi-E4; Fri, 05 Apr 2024 16:12:04 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1rspuq-0001LB-OO for guix-patches@gnu.org; Fri, 05 Apr 2024 16:12:00 -0400 Received: from debbugs.gnu.org ([2001:470:142:5::43]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1rspuo-0003Sy-PU for guix-patches@gnu.org; Fri, 05 Apr 2024 16:11:58 -0400 Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1rspuu-0004XN-Bb for guix-patches@gnu.org; Fri, 05 Apr 2024 16:12:04 -0400 X-Loop: help-debbugs@gnu.org Subject: [bug#70222] [PATCH 05/13] gnu: Add perl-web-scraper. Resent-From: Steve George Original-Sender: "Debbugs-submit" Resent-CC: guix-patches@gnu.org Resent-Date: Fri, 05 Apr 2024 20:12:04 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 70222 X-GNU-PR-Package: guix-patches X-GNU-PR-Keywords: patch To: 70222@debbugs.gnu.org Cc: Steve George Received: via spool by 70222-submit@debbugs.gnu.org id=B70222.171234790317192 (code B ref 70222); Fri, 05 Apr 2024 20:12:04 +0000 Received: (at 70222) by debbugs.gnu.org; 5 Apr 2024 20:11:43 +0000 Received: from localhost ([127.0.0.1]:37567 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1rspuZ-0004TD-4q for submit@debbugs.gnu.org; Fri, 05 Apr 2024 16:11:43 -0400 Received: from mailtransmit04.runbox.com ([2a0c:5a00:149::25]:39296) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1rspuX-0004S1-3P for 70222@debbugs.gnu.org; Fri, 05 Apr 2024 16:11:41 -0400 Received: from mailtransmit03.runbox ([10.9.9.163] helo=aibo.runbox.com) by mailtransmit04.runbox.com with esmtps (TLS1.2) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.93) (envelope-from ) id 1rspuL-004Mur-Ls for 70222@debbugs.gnu.org; Fri, 05 Apr 2024 22:11:29 +0200 DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=futurile.net; s=selector1; h=Content-Transfer-Encoding:MIME-Version: References:In-Reply-To:Message-ID:Date:Subject:Cc:To:From; bh=TxqfqjlTA/92wtqjFYV8NoRlkGgKsMIX9bwdkfS6ld0=; b=LBVhJOoUvdSh7QmNFFVMkCJwSK qzhZditoU708Pul5cAPbWR+nEX4EUS2ri3u2/MhXcc2lmsWk2b+EyEM5EhegB7naWDGzvidbjclYs vhn5qpYLsLdoVfyUSi1o9J6IdEuGVa2mL73g5UzcYeQ3N70EuJc3uAdpsO5GOZpMMbwwt4cWVgTTd ar9IDPqO7O5GHCWkbUwZVscEpMnFQsrQR4kMx0xpO0VlZyTbp5M3Waom5SKazIc5wK/4xxP8P/Ceg sqyM+m1Qu3iFGkYF0tB+vZhefB6W9X31/pFakvZwAjq1P1H5++mhkU/HAflhHayFKWP3/X8huUszF HNqAbgjw==; Received: from [10.9.9.73] (helo=submission02.runbox) by mailtransmit03.runbox with esmtp (Exim 4.86_2) (envelope-from ) id 1rspuL-0004Og-66 for 70222@debbugs.gnu.org; Fri, 05 Apr 2024 22:11:29 +0200 Received: by submission02.runbox with esmtpsa [Authenticated ID (641962)] (TLS1.2:ECDHE_SECP256R1__RSA_SHA256__AES_256_GCM:256) (Exim 4.93) id 1rspuA-005ZhF-HV; Fri, 05 Apr 2024 22:11:18 +0200 From: Steve George Date: Fri, 5 Apr 2024 21:10:12 +0100 Message-ID: X-Mailer: git-send-email 2.41.0 In-Reply-To: References: MIME-Version: 1.0 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: guix-patches@gnu.org List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-patches-bounces+patchwork=mira.cbaines.net@gnu.org Sender: guix-patches-bounces+patchwork=mira.cbaines.net@gnu.org X-getmail-retrieved-from-mailbox: Patches * gnu/packages/web.scm (perl-web-scraper): New variable. Change-Id: I3d0a476ca6c09bc20391303f99b260410f16de65 --- gnu/packages/web.scm | 32 ++++++++++++++++++++++++++++++++ 1 file changed, 32 insertions(+) diff --git a/gnu/packages/web.scm b/gnu/packages/web.scm index ed7ccd6640d..9ad23a26dda 100644 --- a/gnu/packages/web.scm +++ b/gnu/packages/web.scm @@ -4775,6 +4775,38 @@ (define-public perl-uri-ws methods for WebSocket URIs as it does for HTTP URIs.") (license license:perl-license))) +(define-public perl-web-scraper + (package + (name "perl-web-scraper") + (version "0.38") + (source + (origin + (method url-fetch) + (uri (string-append "mirror://cpan/authors/id/M/MI/MIYAGAWA/Web-Scraper-" + version ".tar.gz")) + (sha256 + (base32 "1gs3fmbc83j34c0sig1hkpnm26ngnyi5kgq5dl8vxvkzimgnwnzr")))) + (build-system perl-build-system) + (native-inputs (list perl-module-build-tiny + perl-test-base + perl-test-requires)) + (propagated-inputs (list perl-html-parser + perl-html-selector-xpath + perl-html-tagset + perl-html-tree + perl-html-treebuilder-xpath + perl-libwww + perl-universal-require + perl-uri + perl-xml-xpathengine + perl-yaml)) + (home-page "https://metacpan.org/release/Web-Scraper") + (synopsis + "Web Scraping toolkit using HTML and CSS Selectors or XPath expressions") + (description "Perl module @code{Web::Scraper} is a toolkit for +traversing and scraping sites, inspired by Ruby's Scapi.") + (license license:perl-license))) + (define-public perl-uri-template (package (name "perl-uri-template")