Code for "Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP" (NeurIPS 2024)

This repository contains the code to reproduce our work "Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP" which was accepted in NeurIPS 2024.

Code Setup

conda create --name vit_decompose python=3.11
conda activate vit_decompose
pip install -r requirements.txt

Dataset Setup

Specify the ImageNet/Waterbirds path in each script that you run.

You can download Imagenet-1000 from here

For the Waterbirds experiments, you can find our cleaned Waterbirds datasets in dataset_archives/

CLIP aligner weights can be downloaded from this link

Experiments

Component ablation : python component_ablation.py
Image retrieval from image (wrt specified property) : python image_retrieval_from_image.py
Image retrieval from text : python image_retrieval_from_text.py; python image_retrieval_from_text.py
Image Segmentation: python zs_segmentation_decompose.py --model_key DINO --num_samples 5000
Zero-shot spurous correlation mitigation: python zs_spur_correlation.py

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
dataset_archives		dataset_archives
datasets		datasets
docs		docs
helpers		helpers
probe_imgs		probe_imgs
timm		timm
README.md		README.md
__init__.py		__init__.py
component_ablation.py		component_ablation.py
image_retrieval_from_image.py		image_retrieval_from_image.py
image_retrieval_from_text.py		image_retrieval_from_text.py
image_retrieval_from_text_quant.py		image_retrieval_from_text_quant.py
imagenet_classes.txt		imagenet_classes.txt
requirements.txt		requirements.txt
templates.txt		templates.txt
visualize_tokens.py		visualize_tokens.py
zs_segmentation_chefer.py		zs_segmentation_chefer.py
zs_segmentation_decompose.py		zs_segmentation_decompose.py
zs_segmentation_gradcam.py		zs_segmentation_gradcam.py
zs_spur_correlation.py		zs_spur_correlation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code for "Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP" (NeurIPS 2024)

Code Setup

Dataset Setup

Experiments

About

Releases

Packages

Languages

SriramB-98/vit-decompose

Folders and files

Latest commit

History

Repository files navigation

Code for "Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP" (NeurIPS 2024)

Code Setup

Dataset Setup

Experiments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages