DiffusionGemma 26B: How Parallel Text Diffusion Accelerates AI
Google DeepMind’s DiffusionGemma 26B uses parallel text diffusion to generate text faster and remember more context. This open‑source model can write up to 1,100 tokens per second and supports a 256 K token window.