ML//Multimodal//SAM
- Segment Anything Model (Meta, 2023): foundation model for image segmentation.
Segment Anything Model (Meta, 2023): foundation model for image segmentation.
Prompt with points, boxes, or text → get pixel-precise masks for any object.
SAM 2 extends to video — track and segment objects across frames.
GroundingDINO + SAM: text detection + pixel segmentation for open-vocabulary understanding.