Image segmentation

tags: Computer vision, Image processing, Object recognition, Foundation models

Predicting per-pixel labels (instance, semantic, panoptic) for an image, including the open-vocabulary regime where target classes are specified by text or visual exemplars at inference time.

Links to this note

Knowledge Base Index
Notes on: SAM 3: Segment Anything with Concepts by Nicolas Carion, Laura Gustafson, Yuan-Ting Hu, Shoubhik Debnath, Ronghang Hu, Didac Suris, Chaitanya Ryali, Kalyan Vasudev Alwala, Haitham Khedr, Andrew Huang, Jie Lei, Tengyu Ma, Baishan Guo, Arpit Kalla, Markus Marks, Joseph Greer, Meng Wang, Peize Sun, Roman Rädle, Triantafyllos Afouras, Effrosyni Mavroudi, Katherine Xu, Tsung-Han Wu, Yu Zhou, Liliane Momeni, Rishi Hazra, Shuangrui Ding, Sagar Vaze, Francois Porcher, Feng Li, Siyuan Li, Aishwarya Kamath, Ho Kei Cheng, Piotr Dollár, Nikhila Ravi, Kate Saenko, Pengchuan Zhang, Christoph Feichtenhofer (2025)

Last changed 2026.04.19 | authored by Hugo Cisneros

Comments

Loading comments...

Back to Notes

Image segmentation

Links to this note

Comments

Leave a comment