DiffusionGemma · block diffusion (GGUF, llama.cpp)
block 0 · step 0/0
speed
Watching the 256-token canvas resolve out of noise: the model's argmax over the whole block, repainted each denoising step, until it settles and the block commits. Optimized path: on-device sampling reductions + device-resident self-conditioning.