Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/yossigandelsman/second_order_lens

Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"
https://github.com/yossigandelsman/second_order_lens

Last synced: about 2 months ago
JSON representation

Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"

Host: GitHub
URL: https://github.com/yossigandelsman/second_order_lens
Owner: yossigandelsman
License: mit
Created: 2024-05-24T22:59:49.000Z (9 months ago)
Default Branch: main
Last Pushed: 2024-11-15T00:50:56.000Z (3 months ago)
Last Synced: 2024-11-15T01:29:41.117Z (3 months ago)
Language: Jupyter Notebook
Homepage: https://yossigandelsman.github.io/clip_neurons/index.html
Size: 5.96 MB
Stars: 28
Watchers: 3
Forks: 3
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE.txt

Awesome Lists containing this project

README

## Interpreting the Second-Order Effects of Neurons in CLIP
Official PyTorch Implementation

### [Paper](https://arxiv.org/abs/2406.04341) | [Project Page](https://yossigandelsman.github.io/clip_neurons/)

[Yossi Gandelsman](https://yossigandelsman.github.io/), [Alexei A. Efros](https://people.eecs.berkeley.edu/~efros/), and [Jacob Steinhardt](https://jsteinhardt.stat.berkeley.edu/)

![Teaser](figures/teaser.png)

🔥 Check out [our previous paper](https://yossigandelsman.github.io/clip_decomposition/) on interpreting attention heads in CLIP with text.
### Setup
We provide an [`environment.yml`](environment.yml) file that can be used to create a Conda environment:

```bash
conda env create -f environment.yml
conda activate prsclip
```

### Obtaining training representations, second-order decomposition, and labels
To pre-compute the representations and the labels for the subsampled ImageNet data, execute:
```bash
datapath='...'
outputdir='...'

python compute_representations.py --model ViT-B-32 --pretrained openai --data_path $datapath --output_dir $outputdir # for representations and classes
python compute_classifier_projection.py --model ViT-B-32 --pretrained openai --output_dir $outputdir # for classifier weights

# Get second order for neurons in layer 9:
python compute_second_order_neuron_prs.py --model ViT-B-32 --pretrained openai --mlp_layer 9 --output_dir $outputdir --data_path $datapath # second order effect
```
### Computing approximation for each neuron
To obtain the first PC that approximates most of the behavior of a single neuron, execute:
```bash
outputdir='...'
inputdir='...' # Set it to be the output of the previous stage

python compute_pcas.py --model ViT-B-32 --pretrained openai --mlp_layer 9 --output_dir $outputdir --input_dir $inputdir
```

### Ablating the second-order effects
To repeat our empirical analysis of the second-order effects, execute:
```bash
python compute_ablations.py --model ViT-B-32 --pretrained openai --mlp_layer 9 --output_dir $outputdir --input_dir $inputdir --data_path $datapath
```

### Decomposing the representation into text descriptions
To decompose the neuron second-order effects into text, execute:
```bash
python compute_text_set_projection.py --model ViT-B-32 --pretrained openai --output_dir $outputdir --data_path text_descriptions/30k.txt # get the text representations
# run the decomposition:
python compute_sparse_decomposition.py --model ViT-B-32 --pretrained openai --output_dir $outputdir --mlp_layer 9 --components 128 --text_descriptions 30k --device cuda:0
```
To verify the reconstruction quality, add the `--evaluate` flag.

### Visualizing top-activated images
Please see a demo for visualizing the images with the largest second-order effects per neuron in `visualize_neurons.ipynb`.

## Applications

### Generating semantic adversarial images
To get adversarial images, please run:

```bash
CUDA_VISIBLE_DEVICES=0 python generate_adversarial_images.py --device cuda:0 --class_0 "vacuum cleaner" --class_1 "cat" --model ViT-B-32 --pretrained openai --dataset_path $outputdir --text_descriptions 30k --mlp_layers 9 --neurons_num 100 --overall_words 50 --results_per_generation 1
```
Note that we used other hyperparameters in the paper, including `--mlp_layers 8 9 10`.

### Evaluating ImageNet segmentation
Please download the dataset from [here](http://calvin-vision.net/bigstuff/proj-imagenet/data/gtsegs_ijcv.mat):

```bash
mkdir imagenet_seg
cd imagenet_seg
wget http://calvin-vision.net/bigstuff/proj-imagenet/data/gtsegs_ijcv.mat
```

To get the evaluation results, please run:

```bash
CUDA_VISIBLE_DEVICES=0 python generate_adversarial_images.py --device cuda:0 --class_0 "stop sign" --class_1 "yield" --model ViT-B-32 --pretrained openai --dataset_path $outputdir --text_descriptions 30k --mlp_layers 9 --neurons_num 100 --overall_words 50 --results_per_generation 4
```
Note that we used other hyperparameters in the paper, including `--mlp_layers 8 9 10`.

### Concept discovery in images
Please see a demo for image concept discovery in `concept_discovery.ipynb`.

## BibTeX
```bibtex
@misc{gandelsman2024interpretingsecondordereffectsneurons,
title={Interpreting the Second-Order Effects of Neurons in CLIP},
author={Yossi Gandelsman and Alexei A. Efros and Jacob Steinhardt},
year={2024},
eprint={2406.04341},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2406.04341},
}
```