start-ocr

Applying pdfplumber + opencv + pytesseract to extract content and metadata from formal PDF files.

Installation

In a virtualenv (see these instructions if you need to create one):

pip3 install start-ocr

Releases

Version Released Bullseye
Python 3.9
Bookworm
Python 3.11
Files
0.0.6 2024-02-08    
0.0.5 2024-02-01    
0.0.4 2023-09-24    
0.0.3 2023-06-02    
0.0.2 2023-06-02    
0.0.1 2023-06-01    

Issues with this package?

Page last updated 2025-07-17 20:31:37 UTC