From patchwork Sun Jun 20 01:07:22 2021 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: slg <0x2d@disroot.org> X-Patchwork-Id: 30522 Return-Path: X-Original-To: patchwork@mira.cbaines.net Delivered-To: patchwork@mira.cbaines.net Received: by mira.cbaines.net (Postfix, from userid 113) id 9A31A27BC78; Sun, 20 Jun 2021 02:10:28 +0100 (BST) X-Spam-Checker-Version: SpamAssassin 3.4.2 (2018-09-13) on mira.cbaines.net X-Spam-Level: X-Spam-Status: No, score=-2.8 required=5.0 tests=BAYES_00,DKIM_SIGNED, MAILING_LIST_MULTI,RCVD_IN_MSPIKE_H2,SPF_HELO_PASS,T_DKIM_INVALID, URIBL_BLOCKED autolearn=unavailable autolearn_force=no version=3.4.2 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) by mira.cbaines.net (Postfix) with ESMTPS id 0612127BC94 for ; Sun, 20 Jun 2021 02:10:24 +0100 (BST) Received: from localhost ([::1]:51766 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1lulyh-0000im-3H for patchwork@mira.cbaines.net; Sat, 19 Jun 2021 21:10:23 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:52074) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1lulyO-0008Uh-23 for guix-patches@gnu.org; Sat, 19 Jun 2021 21:10:04 -0400 Received: from debbugs.gnu.org ([209.51.188.43]:49490) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1lulyN-000115-Pq for guix-patches@gnu.org; Sat, 19 Jun 2021 21:10:03 -0400 Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1lulyN-0004Ie-KS for guix-patches@gnu.org; Sat, 19 Jun 2021 21:10:03 -0400 X-Loop: help-debbugs@gnu.org Subject: [bug#49123] [PATCH 04/24] gnu: Add python-html-sanitizer. Resent-From: slg <0x2d@disroot.org> Original-Sender: "Debbugs-submit" Resent-CC: guix-patches@gnu.org Resent-Date: Sun, 20 Jun 2021 01:10:03 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 49123 X-GNU-PR-Package: guix-patches X-GNU-PR-Keywords: patch To: 49123@debbugs.gnu.org Cc: slg <0x2d@disroot.org> Received: via spool by 49123-submit@debbugs.gnu.org id=B49123.162415138916402 (code B ref 49123); Sun, 20 Jun 2021 01:10:03 +0000 Received: (at 49123) by debbugs.gnu.org; 20 Jun 2021 01:09:49 +0000 Received: from localhost ([127.0.0.1]:32782 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1luly9-0004GU-Lx for submit@debbugs.gnu.org; Sat, 19 Jun 2021 21:09:49 -0400 Received: from knopi.disroot.org ([178.21.23.139]:43986) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from <0x2d@disroot.org>) id 1luly6-0004G8-Us for 49123@debbugs.gnu.org; Sat, 19 Jun 2021 21:09:47 -0400 Received: from localhost (localhost [127.0.0.1]) by disroot.org (Postfix) with ESMTP id 5C3625C886; Sun, 20 Jun 2021 03:09:46 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at disroot.org Received: from knopi.disroot.org ([127.0.0.1]) by localhost (disroot.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id MHn-Z_b9-pq3; Sun, 20 Jun 2021 03:09:45 +0200 (CEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=disroot.org; s=mail; t=1624151384; bh=yPbrCwKQJztzrATwAHzdVotg9UZdM2GUZE8t1LIiIrs=; h=From:To:Cc:Subject:Date:In-Reply-To:References; b=gnlWRTTyYtlvgf3tzzJo3iGSqx55mc37j/gc3/acxVoOKEPjVQzEWzPrprTvxiJOZ 7n5Ikm1kNnYieYmIqOEQhQsRX6GS47kztSz2y6pkDvoTn1vclA7BROJNT364OXPxdm j95caz1qs6O7tspKqr0UCfp7QUYdiCjdCAf/hq2sNr00OtzVILz153/nRHwEJkf3Bd 9SMRtS0173PdRi83OXE1sfqb+wD+0+j+1u817PVZ0j6kuIJNRy0aanaVl2l5Zxareo Ao0WMD6/leS5fB1aXkWVlwGAJVYlx0CY3UCngGVw4GAreLSnuoRt1no+andtbnt9Yr w/i2FDjiokk2Q== Date: Sat, 19 Jun 2021 22:07:22 -0300 Message-Id: <20210620010742.4259-4-0x2d@disroot.org> In-Reply-To: <20210620010742.4259-1-0x2d@disroot.org> References: <20210620010742.4259-1-0x2d@disroot.org> Mime-Version: 1.0 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: guix-patches@gnu.org List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: guix-patches-bounces+patchwork=mira.cbaines.net@gnu.org Sender: "Guix-patches" Reply-to: slg <0x2d@disroot.org> X-ACL-Warn: , slg via Guix-patches X-Patchwork-Original-From: slg via Guix-patches via From: slg <0x2d@disroot.org> X-getmail-retrieved-from-mailbox: Patches * gnu/packages/python-xyz.scm (python-html-sanitizer): New variable. --- gnu/packages/python-xyz.scm | 23 +++++++++++++++++++++++ 1 file changed, 23 insertions(+) diff --git a/gnu/packages/python-xyz.scm b/gnu/packages/python-xyz.scm index bb263a6f20..54c7d91116 100644 --- a/gnu/packages/python-xyz.scm +++ b/gnu/packages/python-xyz.scm @@ -25990,3 +25990,26 @@ is the cythonized version of @code{fractions.Fraction}.") "@code{pathvalidate} is a Python library to sanitize/validate strings representing paths or filenames.") (license license:expat))) + +(define-public python-html-sanitizer + (package + (name "python-html-sanitizer") + (version "1.9.1") + (source + (origin + (method url-fetch) + (uri (pypi-uri "html-sanitizer" version)) + (sha256 + (base32 "1mb9d80mar5y4ariqbmrxn5llcffh5qp3s3ywd05bxgg7jzaxh0k")))) + (build-system python-build-system) + (propagated-inputs + `(("python-lxml" ,python-lxml) + ("python-beautifulsoup4" ,python-beautifulsoup4))) + (home-page "https://github.com/matthiask/html-sanitizer") + (synopsis "Python library to sanitize HTML") + (description "This is a allowlist-based and very opinionated HTML sanitizer +that can be used both for untrusted and trusted sources. It attempts to clean +up the mess made by various rich text editors and or copy-pasting to make +styling of webpages simpler and more consistent. It builds on the excellent +HTML cleaner in lxml to make the result both valid and safe.") + (license license:bsd-3)))