GLIDE

tags
Diffusion models, NLP, Computer vision
paper
(Nichol et al. 2022)

Architecture

This model uses joint textual and visual embedding diffusion model followed by some upsampling.

Parameter count

3.5B

Bibliography

  1. . . "GLIDE: Towards Photorealistic Image Generation and Editing with Text-guided Diffusion Models". arXiv. DOI.

Links to this note

Last changed | authored by

Comments


← Back to Notes