lm-datasets

A collection of datasets for language model training including scripts for downloading, preprocesssing, and sampling.

Installation

In a virtualenv (see these instructions if you need to create one):

pip3 install lm-datasets

Releases

Version Released Buster
Python 3.7
Bullseye
Python 3.9
Bookworm
Python 3.11
Files
0.0.2 2023-12-15      
0.0.1 2023-09-18      

Issues with this package?

Page last updated 2023-12-15 15:45:16 UTC