■
DiffusionGemma
· block diffusion (GGUF, llama.cpp)
block 0 · step 0/0
Pause
Restart
speed
0.5x
1x
2x
Watching the 256-token canvas resolve out of noise: the model's argmax over the whole block, repainted each denoising step, until it settles and the block commits. Optimized path: on-device sampling reductions + device-resident self-conditioning.