add track pipeline vis

2024-08-06 03:03:23 +08:00
parent 0563b4e368
commit bb56590c47
2 changed files with 4 additions and 0 deletions
--- a/README.md
+++ b/README.md
@@ -108,6 +108,10 @@ python grounded_sam2_tracking_demo.py

 We have observed that the video predictor in SAM 2 currently **did not support box prompts** (please feel free to point out any updates or functionalities we may have overlooked during development). However, Grounding DINO provides box prompts, which need to be converted into point prompts for use in video tracking. A straightforward approach is to directly sample the center point of the box as a point prompt. Nevertheless, this method may encounter certain issues in practical testing scenarios. To **get a more stable segmentation results**, we reuse the SAM 2 image predictor to get the prediction mask for each object first, then we **uniformly sample points from the prediction mask** to prompt SAM 2 video predictor.

+A simple pipeline visualization is as follows:
+
+![Grounded SAM 2 Tracking Pipeline](./assets/grounded_sam2_tracking_pipeline_vis.png)
+
 ### Grounded-SAM-2 Video Object Tracking Demo (with Grounding DINO 1.5 & 1.6)

 We've also support video object tracking demo based on our stronger `Grounding DINO 1.5` model and `SAM 2`, you can try the following demo after applying the API keys for running `Grounding DINO 1.5`: