Global context ViT

tags: Transformers, Computer vision, ViT
paper: (Hatamizadeh et al. 2022)

Architecture

This is a hierarchical version of ViT with both local and global attention.

Parameter count

90M

Bibliography

Ali Hatamizadeh, Hongxu Yin, Jan Kautz, Pavlo Molchanov. June 20, 2022. "Global Context Vision Transformers". arXiv. DOI.

Last changed 2022.07.26 | authored by Hugo Cisneros

Comments

Loading comments...

Back to Notes