vajra

A high-throughput and low-latency LLM inference system

Installation

In a virtualenv (see these instructions if you need to create one):

pip3 install vajra

Releases

There are no releases of this package yet

Issues with this package?

Page last updated 2025-07-12 09:27:36 UTC