Back to Glossary

Text-to-Image

Image/Audio/Video

Generating images from text descriptions.


Text-to-image generation uses generative models to create new, realistic images from text prompts. Modern systems such as DALL·E, Midjourney, or Stable Diffusion rely on diffusion or transformer architectures.

  • Example: “A fox wearing sunglasses in the style of Van Gogh.”
  • Note: Prompts require clarity and respect for copyright boundaries.