DALL-E

tags: Transformers, GPT
paper: (Ramesh et al. 2021)

Architecture

It is a decoder architecture with a Variational autoencoders and a variant of GPT-3 to convert text to images.

Parameter count

12B

Bibliography

Aditya Ramesh, Mikhail Pavlov, Gabriel Goh, Scott Gray, Chelsea Voss, Alec Radford, Mark Chen, Ilya Sutskever. February 26, 2021. "Zero-Shot Text-to-Image Generation". February 26, 2021DOI.

Links to this note

Last changed 2022.07.22 | authored by Hugo Cisneros

Comments

Loading comments...

Back to Notes