| # Tutorial for WebUI 1.5 Version |
| ## We have added two new features |
| - We have added text prompts to allow for interactive selection of objects that will be tracked in the video. |
| - We can now interactively add multiple objects for tracking in the video. |
|
|
|
|
| ## Text-Prompts |
| ### 1. Clone Grounding-DINO to `./src` |
| ``` |
| pip install -e git+https://github.com/IDEA-Research/GroundingDINO.git@main#egg=GroundingDINO |
| ``` |
|
|
| ### 2. Switch to Text-Tab by clicking `Text` Tab |
|
|
| <p align="center"> |
| <img src="./img/switch2textT.jpg" height="400"> |
| </p> |
|
|
| ### 3. Upload video or use example dicectly |
|
|
| ### 4. Enter text to select the objects you are interested in |
| - The `.` is used to split text, just like in the original Grounding-Dino setting. |
|
|
| <p align="center"> |
| <img src="./img/enter_text.jpg" height="400", width="400"> |
| </p> |
|
|
| ### 5. Get mask of selected object by clicking `Detect` button |
| - SAMTrack initialization may take some time. |
|
|
| <p align="center"> |
| <img src="./img/detect_result.jpg" height="400", width="400"> |
| </p> |
|
|
| ### 6. Track in video |
|
|
| ## Multi-Objects select |
| ### 1. Once we interactively add an object mask, we can click the `Add new object button` to prepare to add a new object. |
|
|
| <p align="center"> |
| <img src="./img/new_object.jpg" height="400", width="400"> |
| </p> |
|
|
| ### 2. Add a new object by clicking object |
|
|
| <p align="center"> |
| <img src="./img/second_object.jpg" height="400", width="400"> |
| </p> |
|
|
| ### 3. You can add as many objects as you want by clicking `Add new object` button. |