Global context ViT

tags
Transformers, Computer vision, ViT
paper
(Hatamizadeh et al. 2022)

Architecture

This is a hierarchical version of ViT with both local and global attention.

Parameter count

90M

Bibliography

  1. . . "Global Context Vision Transformers". arXiv. DOI.
Last changed | authored by

Comments


← Back to Notes