Diffusion-based text generation

June 15, 2025

Diffusion-based text generation is a method that uses diffusion models, originally developed for image generation, to generate text by iteratively refining random noise into coherent sequences of words or tokens. Instead of predicting the next word in a sequence (as in traditional autoregressive models like GPT), diffusion models gradually “denoise” a random text representation over several steps to produce meaningful language.

This approach is inspired by score-based generative modeling and denoising diffusion probabilistic models (DDPMs) and involves learning how to reverse a noising process applied to text embeddings or token representations.

Key Characteristics:

Non-autoregressive: It generates entire sequences rather than word-by-word.
Iterative refinement: Text is generated through multiple steps of noise removal.
Emerging area: Still less mature than transformer-based methods but promising in quality and diversity of output.

Example Use Case:

OpenAI’s Sora and other research from Google and Meta are exploring text or code generation with diffusion techniques to improve controllability or creativity.

Diffusion-based text generation

Key Characteristics:

Example Use Case:

About us

Company