Links to this note
-
Adversarial examples
-
CLIP
-
Flamingo
-
Geospatial AI
-
GLIDE
-
Global context ViT
-
Image classification
-
Image segmentation
-
Imagen
-
Knowledge Base Index
-
Notes on: End-to-End Object Detection with Transformers by Nicolas Carion, Francisco Massa, Gabriel Synnaeve, Nicolas Usunier, Alexander Kirillov, Sergey Zagoruyko (2020)
-
Notes on: Perception Encoder: The best visual embeddings are not at the output of the network by Daniel Bolya, Po-Yao Huang, Peize Sun, Jang Hyun Cho, Andrea Madotto, Chen Wei, Tengyu Ma, Jiale Zhi, Jathushan Rajasegaran, Hanoona Rasheed, Junke Wang, Marco Monteiro, Hu Xu, Shiyu Dong, Nikhila Ravi, Daniel Li, Piotr Dollár, Christoph Feichtenhofer (2025)
-
Notes on: SAM 3: Segment Anything with Concepts by Nicolas Carion, Laura Gustafson, Yuan-Ting Hu, Shoubhik Debnath, Ronghang Hu, Didac Suris, Chaitanya Ryali, Kalyan Vasudev Alwala, Haitham Khedr, Andrew Huang, Jie Lei, Tengyu Ma, Baishan Guo, Arpit Kalla, Markus Marks, Joseph Greer, Meng Wang, Peize Sun, Roman Rädle, Triantafyllos Afouras, Effrosyni Mavroudi, Katherine Xu, Tsung-Han Wu, Yu Zhou, Liliane Momeni, Rishi Hazra, Shuangrui Ding, Sagar Vaze, Francois Porcher, Feng Li, Siyuan Li, Aishwarya Kamath, Ho Kei Cheng, Piotr Dollár, Nikhila Ravi, Kate Saenko, Pengchuan Zhang, Christoph Feichtenhofer (2025)
-
Object recognition
-
Open-vocabulary detection
-
Raven's progressive matrices
-
Residual neural networks
-
Self-supervised learning
-
Style transfer
-
Swin Transformer
-
Transformers
-
Vision transformer
Last changed
| authored by
Hugo Cisneros
Comments
Back to Notes
Loading comments...