
Imagen: Text-to-Image Diffusion Models
We present Imagen, a text-to-image diffusion model with an unprecedented degree of photorealism and a deep level of language understanding. Imagen builds on the power of …
Imagen Editor & EditBench
A key challenge is to generate edits that are faithful to input text prompts, while consistent with input images. We present Imagen Editor, a cascaded diffusion model built by fine-tuning …
Imagen Video
Generative modeling has made tremendous progress, especially in recent text-to-image models. Imagen Video is another step forward in generative modelling capabilities, advancing text-to …
each sub-model relatively simple. Imagen (Saharia et al., 2022b) also showed that by conditioning on text embeddings from a large frozen language model in conjunction with cascaded diffusion …
To probe image quality, the rater is asked to select between the model generation and reference image using the question: “Which image is more photorealistic (looks more real)?”.