- Install dependencies in
requirements.txt
- Create
image_path.txt
and write the image path in the file - Create a black image (e.g. using Microsoft Paint) and save as
black.png
in the root folder
- NVIDIA TITAN V 12GB (6.7GB VRAM was observed to be used by
blip2_clipseg.py
)
- Read user input for instruction
- Verify that the object is in scene
- Draw heatmap with CLIPSeg and save as
mask.png
- Overlay heatmap over source image and save as
annotated.png
Use your own prompt. Recommended to use Question: <insert question> Answer:
format
Input keyword, CLIPSeg draws heatmap