Panoptic Segment Anything Save

Combining Segment Anything (SAM) with Grounded DINO for zero-shot object detection and CLIPSeg for zero-shot segmentation

Project README

Zero-shot panoptic segmentation using SAM

This is a proof of concept for zero-shot panoptic segmentation using the Segment Anything Model (SAM).

SAM cannot immediately achieve panoptic segmentation due to two limitations:

The released version of SAM is not text-aware
The authors of Segment Anything mention that it is unclear how to design simple prompts that implement semantic and panoptic segmentation

To solve these challenges we use the following additional models:

Grounding DINO, a zero-shot object detector
CLIPSeg, a zero-shot (binary) segmentation model

You can try out the pipeline by running the notebook in Colab or by trying out the Gradio demo on Hugging Face Spaces.

The notebook also shows how the predictions from this pipeline can be uploaded to Segments.ai as pre-labels, where you can adjust them to obtain perfect labels for fine-tuning your segmentation model.

🖼️Results

Results

🏗️ Pipeline

Our Frankenstein-ish pipeline looks as follows:

Use Grounding DINO to detect the "thing" categories (categories with instances)
Get instance segmentation masks for the detected boxes using SAM
Use CLIPSeg to obtain rough segmentation masks of the "stuff" categories
Sample points in these rough segmentation masks and feed these to SAM to get fine segmentation masks
Combine the background "stuff" masks with the foreground "thing" masks to obtain a panoptic segmentation label

💘 Acknowledgements

Steps 1. and 2. are the same as Grounded-Segment-Anything. We also used their notebooks as a starting point.
Segment Anything
Grounding DINO
CLIPSeg (see our blog post on Hugging Face for more information)
yizhangliu/Grounded-Segment-Anything on Hugging Face Spaces

Open Source Agenda is not affiliated with "Panoptic Segment Anything" Project. README Source: segments-ai/panoptic-segment-anything

Stars

340

Open Issues

Last Commit

1 month ago

Repository

segments-ai/panoptic-segment-anything

License

Apache-2.0

Open Source Agenda Badge

<a href="https://www.opensourceagenda.com/projects/panoptic-segment-anything"><img src="https://www.opensourceagenda.com/projects/panoptic-segment-anything/reviews/badge.svg" alt="Open Source Agenda"></a>

Submit Review Review Your Favorite Project

Submit Resource Articles, Courses, Videos

Submit Article Submit a post to our blog

From the blog

Dec 11, 2022

How to Choose Which Programming Language to Learn First?

From the blog

Dec 11, 2022