From patchwork Fri Apr 28 05:50:09 2023 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: John Kehayias X-Patchwork-Id: 49621 Return-Path: X-Original-To: patchwork@mira.cbaines.net Delivered-To: patchwork@mira.cbaines.net Received: by mira.cbaines.net (Postfix, from userid 113) id C9CDC27BBEA; Fri, 28 Apr 2023 06:51:25 +0100 (BST) X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on mira.cbaines.net X-Spam-Level: X-Spam-Status: No, score=-3.7 required=5.0 tests=BAYES_00,DKIM_INVALID, DKIM_SIGNED,FREEMAIL_FROM,MAILING_LIST_MULTI,RCVD_IN_MSPIKE_H2, SPF_HELO_PASS,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) by mira.cbaines.net (Postfix) with ESMTPS id 0476327BBE2 for ; Fri, 28 Apr 2023 06:51:25 +0100 (BST) Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1psH0i-0005Xe-Jm; Fri, 28 Apr 2023 01:51:13 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1psH0a-0005WM-68 for guix-patches@gnu.org; Fri, 28 Apr 2023 01:51:04 -0400 Received: from debbugs.gnu.org ([209.51.188.43]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1psH0Z-0003AP-Uq for guix-patches@gnu.org; Fri, 28 Apr 2023 01:51:03 -0400 Received: from Debian-debbugs by debbugs.gnu.org with local (Exim 4.84_2) (envelope-from ) id 1psH0Z-0006Z7-QZ for guix-patches@gnu.org; Fri, 28 Apr 2023 01:51:03 -0400 X-Loop: help-debbugs@gnu.org Subject: [bug#63139] [PATCH python 03/20] gnu: Add python-rapidfuzz. Resent-From: John Kehayias Original-Sender: "Debbugs-submit" Resent-CC: guix-patches@gnu.org Resent-Date: Fri, 28 Apr 2023 05:51:03 +0000 Resent-Message-ID: Resent-Sender: help-debbugs@gnu.org X-GNU-PR-Message: followup 63139 X-GNU-PR-Package: guix-patches X-GNU-PR-Keywords: patch To: 63139@debbugs.gnu.org Cc: John Kehayias Received: via spool by 63139-submit@debbugs.gnu.org id=B63139.168266104725065 (code B ref 63139); Fri, 28 Apr 2023 05:51:03 +0000 Received: (at 63139) by debbugs.gnu.org; 28 Apr 2023 05:50:47 +0000 Received: from localhost ([127.0.0.1]:60266 helo=debbugs.gnu.org) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1psH0J-0006WA-7o for submit@debbugs.gnu.org; Fri, 28 Apr 2023 01:50:47 -0400 Received: from mail-40133.protonmail.ch ([185.70.40.133]:38463) by debbugs.gnu.org with esmtp (Exim 4.84_2) (envelope-from ) id 1psH0F-0006Uk-7f for 63139@debbugs.gnu.org; Fri, 28 Apr 2023 01:50:43 -0400 Date: Fri, 28 Apr 2023 05:50:09 +0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=protonmail.com; s=protonmail3; t=1682661037; x=1682920237; bh=NsZVgWy9+Io26jhMAp9jNnTC/H1z+YNQJb8u0wWBy5E=; h=Date:To:From:Cc:Subject:Message-ID:In-Reply-To:References: Feedback-ID:From:To:Cc:Date:Subject:Reply-To:Feedback-ID: Message-ID:BIMI-Selector; b=Iwk3aDBSEUx3L9YtrINoFnk3UPqkHDt1/VrTKIdHGlhxOdvCXgVoA81EDpBqUjlHq W/Fx745XDejPapuVZyNhlm3bYbHKUIsUcnvRCjF2w5aCDsr7V2vuGl3Ww7BmjkKu48 jZN+vOTqeHg356XNrpO1tsDeSxdQM9yzm4toQ16JShjqBr1CbOEsiC/5FQM/Cu40sz NY255C46E3NVZ+hmCdFAGnf2RvHdLzfFEFGfqi36mdF4WQAmdb6qW2GEcxy5+Ktb8T tEFTomEK2hsMl5ndA+8ONOE2Q/Q+agPCUGLrngLh5hRoRt6utGzupoKlM29gf80FTX Idi1cTuNjk7WA== Message-ID: <885e568746f3611bc4f4f4f2f2b2ecb63482cb14.1682660104.git.john.kehayias@protonmail.com> In-Reply-To: References: Feedback-ID: 7805494:user:proton MIME-Version: 1.0 X-BeenThere: debbugs-submit@debbugs.gnu.org X-Mailman-Version: 2.1.18 Precedence: list X-BeenThere: guix-patches@gnu.org List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-to: John Kehayias X-ACL-Warn: , John Kehayias via Guix-patches X-Patchwork-Original-From: John Kehayias via Guix-patches via From: John Kehayias Errors-To: guix-patches-bounces+patchwork=mira.cbaines.net@gnu.org Sender: guix-patches-bounces+patchwork=mira.cbaines.net@gnu.org X-getmail-retrieved-from-mailbox: Patches * gnu/packages/python-xyz.scm (python-rapidfuzz): New variable. --- gnu/packages/python-xyz.scm | 33 +++++++++++++++++++++++++++++++++ 1 file changed, 33 insertions(+) -- 2.39.2 diff --git a/gnu/packages/python-xyz.scm b/gnu/packages/python-xyz.scm index 83beb1c8ef..4d1f291e68 100644 --- a/gnu/packages/python-xyz.scm +++ b/gnu/packages/python-xyz.scm @@ -17793,6 +17793,39 @@ (define-public python-pylev @url{http://en.wikipedia.org/wiki/Levenshtein_distance}.") (license license:bsd-3))) +(define-public python-rapidfuzz + (package + (name "python-rapidfuzz") + (version "2.15.1") + (source (origin + (method url-fetch) + (uri (pypi-uri "rapidfuzz" version)) + (sha256 + (base32 + "1xh0mkbhgnrwgwhrlnmypwwig3ww23fdffh0245akbiprb13f8fn")))) + (build-system python-build-system) + (native-inputs (list python-hypothesis python-numpy python-pytest + python-scikit-build)) + (home-page "https://github.com/maxbachmann/RapidFuzz") + (synopsis "Rapid fuzzy string matching using various string metrics") + (description + "RapidFuzz is a fast string matching library for Python and +C++, which is using the string similarity calculations from FuzzyWuzzy. +However there are a couple of aspects that set RapidFuzz apart from +FuzzyWuzzy: +@itemize +@item It is MIT licensed so it can be used whichever License you might want to +choose for your project, while you're forced to adopt the GPL license when +using FuzzyWuzzy. +@item It provides many @code{string_metrics} like @code{hamming} or +@code{jaro_winkler}, which are not included in FuzzyWuzzy. +@item It is mostly written in C++ and on top of this comes with a lot of +algorithmic improvements to make string matching even faster, while still +providing the same results. For detailed benchmarks check the documentation. +@item Fixes multiple bugs in the partial_ratio implementation. +@end itemize") + (license license:expat))) + (define-public python-cleo (package (name "python-cleo")