flexgen

Running large language models like OPT-175B/GPT-3 on a single GPU. Focusing on high-throughput large-batch generation.

Installation

In a virtualenv (see these instructions if you need to create one):

pip3 install flexgen

Releases

Version Released Buster
Python 3.7
Bullseye
Python 3.9
Bookworm
Python 3.11
Files
0.1.8 2023-10-09      
0.1.7 2023-03-01    
0.1.6 2023-02-27    
0.1.5 2023-02-26    
0.1.4 2023-02-26    
0.1.3 2023-02-26    
0.1.2 2023-02-26    
0.1.1 2023-02-26    

Issues with this package?

Page last updated 2023-10-27 23:44:44 UTC