ML//Multimodal//SAM

2026-02-26

- Segment Anything Model (Meta, 2023): foundation model for image segmentation.

Segment Anything Model (Meta, 2023): foundation model for image segmentation.

Prompt with points, boxes, or text → get pixel-precise masks for any object.

SAM 2 extends to video: track and segment objects across frames.

GroundingDINO + SAM: text detection + pixel segmentation for open-vocabulary understanding.