mineru-html

MinerU-HTML is a main content extraction tool based on Small Language Models.

Installation

In a virtualenv (see these instructions if you need to create one):

pip3 install mineru-html

Dependencies

  • None

Releases

Version Released Bullseye
Python 3.9
Bookworm
Python 3.11
Trixie
Python 3.13
Files
1.1.2 2026-03-26      
1.1.1 2026-03-24      
1.1.0 2026-03-19      
1.0.0 2026-03-19      

Issues with this package?

Page last updated 2026-04-10 22:40:18 UTC