vllm-test-tpu

A high-throughput and memory-efficient inference and serving engine for LLMs

Installation

In a virtualenv (see these instructions if you need to create one):

pip3 install vllm-test-tpu

Dependencies

  • None

Releases

Version Released Bullseye
Python 3.9
Bookworm
Python 3.11
Trixie
Python 3.13
Files
0.9.0.1 2025-05-20      

Issues with this package?

Page last updated 2025-09-22 00:12:11 UTC