ademamix

AdEMAMix is a PyTorch optimizer that combines two EMAs to better utilize past gradients, offering improved convergence and model retention over AdamW.

Installation

In a virtualenv (see these instructions if you need to create one):

pip3 install ademamix

Dependencies

  • None

Releases

There are no releases of this package yet

Issues with this package?

Page last updated 2026-05-13 02:17:19 UTC