Open Vocabulary Segmentation with Prompt Learning

This repository contains the codebase of prompt learning techniques integrated with CAT-Seg (CVPR'24) to adapt the Vision-Language Model CLIP to the downstream task of semantic segmentation in an Open-Vocabulary setting

Following is the list of the Prompt Learning techniques contained in this repository

Context Optimization CoOp (IJCV'22)
- Modeling a prompt’s context using a set of learnable vectors, which can be optimized through minimizing the loss
- Instead of using a vanilla template "a photo of a [CLASS]", use learnable context vectors as prompts
  - e.g. "X X X X [CLASS]"
- The integration of this technique into CAT-Seg can be found in class CLIP of ./catseg/third_party/model_vpt.py on main
Conditional Context Optimization CoCoOp (CVPR'22)
- It follows a similar approach as CoOp but the in this case, the context vectors are conditioned on the image features
- This augments the learnable prompts with the image context as a prior
- The integration of this technique into CAT-Seg can be found in class CLIP of ./catseg/third_party/model_vpt.py on branch CoCoOp
Textual-based Class-aware Prompt tuning for Visual-Language Model TCP (CVPR'24)
- This technique proposes to induce textual-knowledge into learnable prompts
- This enhances the generalizability across unseen classes by combining the prior textual knowledge into the finetuned learnable prompts
- The integration of this technique into CAT-Seg can be found in class CLIP of ./catseg/third_party/model_vpt.py on branch TCP

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
assets		assets
cat_seg		cat_seg
configs		configs
demo		demo
CAT-Seg_README.md		CAT-Seg_README.md
INSTALL.md		INSTALL.md
LICENSE		LICENSE
README.md		README.md
cocoop2.png		cocoop2.png
coop.png		coop.png
eval.sh		eval.sh
plain_train_net.py		plain_train_net.py
requirements.txt		requirements.txt
run.sh		run.sh
tcp.png		tcp.png
train_net.py		train_net.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Open Vocabulary Segmentation with Prompt Learning

About

Releases

Packages

Languages

License

adityagandhamal/prompt-learning-ovss

Folders and files

Latest commit

History

Repository files navigation

Open Vocabulary Segmentation with Prompt Learning

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages