html-chunking

A Python package for token-aware HTML chunking that preserves structure and attributes, with optional cleaning and attribute length control.

Installation

In a virtualenv (see these instructions if you need to create one):

pip3 install html-chunking

Releases

Version Released Bullseye
Python 3.9
Bookworm
Python 3.11
Files
0.0.4 2024-09-12  
0.0.3 2024-09-12  
0.0.2 2024-09-12  
0.0.1 2024-09-12  

Issues with this package?

Page last updated 2025-07-18 00:29:21 UTC