khmerdocparser

A smart Python tool to extract Khmer text from PDF and image files, using OCR for scanned documents and direct extraction for native PDFs.

Installation

In a virtualenv (see these instructions if you need to create one):

pip3 install khmerdocparser

Dependencies

  • None

Releases

Version Released Bullseye
Python 3.9
Bookworm
Python 3.11
Files
0.3.0 2025-08-11
0.2.1 2025-08-11
0.2.0 2025-08-11
0.1.0 2025-08-11

Issues with this package?

Page last updated 2025-08-16 16:21:15 UTC