nlp-dedup

Remove duplicates and near-duplicates from text corpora, no matter the scale.

Installation

In a virtualenv (see these instructions if you need to create one):

pip3 install nlp-dedup

Releases

Version Released Buster
Python 3.7
Bullseye
Python 3.9
Bookworm
Python 3.11
Files
0.1.2 2023-10-07      
0.1.1 2022-12-20    
0.1.0 2022-12-20    

Issues with this package?

Page last updated 2023-10-28 01:05:54 UTC