From patchwork Wed Apr 20 17:28:04 2022 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: Felix Gruber X-Patchwork-Id: 38709 Return-Path: X-Original-To: patchwork@mira.cbaines.net Delivered-To: patchwork@mira.cbaines.net Received: by mira.cbaines.net (Postfix, from userid 113) id 2FB9427BBEA; Wed, 20 Apr 2022 18:29:55 +0100 (BST) X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on mira.cbaines.net X-Spam-Level: X-Spam-Status: No, score=-2.7 required=5.0 tests=BAYES_00,DKIM_INVALID, DKIM_SIGNED,MAILING_LIST_MULTI,SPF_HELO_PASS,URIBL_BLOCKED autolearn=unavailable autolearn_force=no version=3.4.6 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) by mira.cbaines.net (Postfix) with ESMTPS id E112F27BBE9 for ; Wed, 20 Apr 2022 18:29:54 +0100 (BST) Received: from localhost ([::1]:37208 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1nhE9K-0000YA-2v for patchwork@mira.cbaines.net; Wed, 20 Apr 2022 13:29:54 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:43674) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1nhE8Y-0007Us-7b for guix-patches@gnu.org; Wed, 20 Apr 2022 13:29:06 -0400 Received: from debbugs.gnu.org ([209.51.188.43]:53470) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1nhE8X-0003DN-Tr for guix-patches@gnu.org; Wed, 20 Apr 2022 13:29:05 -0400 Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1nhE8X-0006K7-R4 for guix-patches@gnu.org; Wed, 20 Apr 2022 13:29:05 -0400 X-Loop: help-debbugs@gnu.org Subject: [bug#55044] [PATCH 8/8] gnu: Add python-scrapy. Resent-From: Felix Gruber Original-Sender: "Debbugs-submit" Resent-CC: guix-patches@gnu.org Resent-Date: Wed, 20 Apr 2022 17:29:05 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 55044 X-GNU-PR-Package: guix-patches X-GNU-PR-Keywords: patch To: 55044@debbugs.gnu.org Cc: Felix Gruber Received: via spool by 55044-submit@debbugs.gnu.org id=B55044.165047572224203 (code B ref 55044); Wed, 20 Apr 2022 17:29:05 +0000 Received: (at 55044) by debbugs.gnu.org; 20 Apr 2022 17:28:42 +0000 Received: from localhost ([127.0.0.1]:47355 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1nhE89-0006IJ-OY for submit@debbugs.gnu.org; Wed, 20 Apr 2022 13:28:42 -0400 Received: from mout01.posteo.de ([185.67.36.65]:41735) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1nhE87-0006HY-W5 for 55044@debbugs.gnu.org; Wed, 20 Apr 2022 13:28:40 -0400 Received: from submission (posteo.de [185.67.36.169]) by mout01.posteo.de (Postfix) with ESMTPS id 6B30A240027 for <55044@debbugs.gnu.org>; Wed, 20 Apr 2022 19:28:34 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=posteo.net; s=2017; t=1650475714; bh=zjhysv4v/owNK54ZfGUBLigHSWAnWhyJZF6zanXhN78=; h=From:To:Cc:Subject:Date:From; b=GPA3lKHGz4/vphNIbjyhIpTNlLCsIbDWLcG+O7ENdPr+Bm//szgc/gpGlsFkFY40A eB8I3x5WKPJYGgvo5Uts1WiuFbPfxOTVVWm7v4ZV7Z5bgDG6HPihEcmJw7vQFd+vUe Smn/LsX8OFov7Wv6EpNPtaavjvw+RNfuNPR4WeLuyrKLj6sh0s7nodJnNeOZLbKnGm tlV45d87eif86TOsybE5rrSxGm70gn26vagZ6l9J11gFk/prZNm8ioXL44DqD8MOve SS6kOPJIgcHrV+hud7Hcp+n5S1yXLtm4pFQQ6TYe3PwYRwAZ7RXCcM1J/IfzLmhSRx 3HPZ+5QvSwKWA== Received: from customer (localhost [127.0.0.1]) by submission (posteo.de) with ESMTPSA id 4Kk71Y6sg3z6tnh; Wed, 20 Apr 2022 19:28:33 +0200 (CEST) From: Felix Gruber Date: Wed, 20 Apr 2022 17:28:04 +0000 Message-Id: <20220420172804.8849-8-felgru@posteo.net> In-Reply-To: <20220420172518.8609-1-felgru@posteo.net> References: <20220420172518.8609-1-felgru@posteo.net> MIME-Version: 1.0 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: guix-patches@gnu.org List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-patches-bounces+patchwork=mira.cbaines.net@gnu.org Sender: "Guix-patches" X-getmail-retrieved-from-mailbox: Patches * gnu/packages/python-web.scm (python-scrapy): New variable. --- gnu/packages/python-web.scm | 60 +++++++++++++++++++++++++++++++++++++ 1 file changed, 60 insertions(+) diff --git a/gnu/packages/python-web.scm b/gnu/packages/python-web.scm index da3f9cf980..f4ff4f494c 100644 --- a/gnu/packages/python-web.scm +++ b/gnu/packages/python-web.scm @@ -6519,3 +6519,63 @@ by asyncio.") HTML and XML using XPath and CSS selectors, optionally combined with regular expressions.") (license license:bsd-3))) + +(define-public python-scrapy + (package + (name "python-scrapy") + (version "2.6.1") + (source + (origin + (method url-fetch) + (uri (pypi-uri "Scrapy" version)) + (sha256 + (base32 "09rqalbwcz9ix8h0992mzjs50sssxsmmh8w9abkrqchgknjmbzan")))) + (build-system python-build-system) + (arguments + `(#:phases + (modify-phases %standard-phases + (replace 'check + (lambda* (#:key tests? #:allow-other-keys) + (when tests? + (invoke "pytest" + ;; requires network access + "--ignore" "tests/test_command_check.py" + "-k" + (string-append + ;; Failing for unknown reasons + "not test_server_set_cookie_domain_suffix_public_private" + " and not test_user_set_cookie_domain_suffix_public_private" + " and not test_pformat") + "tests"))))))) + (propagated-inputs + (list python-botocore ; Optional: For S3FeedStorage class. + python-cryptography + python-cssselect + python-itemadapter + python-itemloaders + python-lxml + python-parsel + python-protego + python-pydispatcher + python-pyopenssl + python-queuelib + python-service-identity + python-setuptools + python-tldextract + python-twisted + python-w3lib + python-zope-interface)) + (native-inputs + (list python-pytest + python-pyftpdlib + python-sybil + python-testfixtures + python-uvloop + )) + (home-page "https://scrapy.org") + (synopsis "A high-level Web Crawling and Web Scraping framework") + (description "Scrapy is a fast high-level web crawling and web +scraping framework, used to crawl websites and extract structured data +from their pages. It can be used for a wide range of purposes, from data +mining to monitoring and automated testing.") + (license license:bsd-3)))