nlp-dedup

Remove duplicates and near-duplicates from text corpora, no matter the scale.

Installation

In a virtualenv (see these instructions if you need to create one):

pip3 install nlp-dedup

Releases

Version Released Bullseye
Python 3.9
Bookworm
Python 3.11
Files
0.1.2 2023-10-07    
0.1.1 2022-12-20  
0.1.0 2022-12-20  

Issues with this package?

Page last updated 2025-06-27 19:57:08 UTC