DiG-IN: Diffusion Guidance for Investigating Networks - Uncovering Classifier Differences, Neuron Visualisations, and Visual Counterfactual Explanations [CVPR2024]

This is the official implementation to our CVPR 2024 paper: DiG-IN: Diffusion Guidance for Investigating Networks - Uncovering Classifier Differences, Neuron Visualisations, and Visual Counterfactual Explanations

Setup

To create a conda environment for this project, please run:

conda create --name dig_in python=3.10
conda activate dig_in
conda install nvidia/label/cuda-12.1.0::cuda-nvcc
pip install -r requirements.txt
python -m spacy download en_core_web_sm

To allow for SAM-HQ segmentation, please download the model from here and put it into a sam_hq folder in the root directory.

All scripts are meant to be executed from the root directory. Most scripts support parallel execution on multiple GPUs via torchrun. Make sure to specify the number of GPUs via:

--nproc-per-node N.

You can specify the CUDA device ids via CUDA_VISIBLE_DEVICES, for example:

CUDA_VISIBLE_DEVICES=0,2,7

Synthethic Neuron Activations using CogVLM

First, you have to calculate the activations on the ImageNet train set. To do so, use:

python src/imagenet_cog_neuron_activations.py imagenet_folder=YOUR/PATH/TO/IMAGENET

Next, we use CogVLM to name the objects in the highest activating train images:

CUDA_VISIBLE_DEVICES=... torchrun --nproc-per-node N --standalone src/imagenet_cog_neuron_visualisation_stage1.py

Finally, you can generate visualisations using DiG-IN via:

CUDA_VISIBLE_DEVICES=... torchrun --nproc-per-node N --standalone src/imagenet_cog_neuron_visualisation_stage2.py

By default, all results will be saved in:

./output_cvpr/imagenet_cogvlm_neurons/

If you want to change the target neurons, you can do so via the argument

target_neurons=[...]

Img2Img: Automatic Captioning and Null-Text Inversion

For the following Img2Img tasks (Neuron Counterfactuals and UVCEs), we use Null-Text Inversion as initialization. Since Null-Text inversion requires a prompt for each image, we use Open-Flamingo to caption the images.

You can download the captions used for our experiments and extract them to the default result folder: "./output_cvpr" or create them by yourself via:

ImageNet:

CUDA_VISIBLE_DEVICES=... torchrun --nproc-per-node N --standalone src/open_flamingo_imagenet.py imagenet_folder=YOUR/PATH/TO/IMAGENET

For CUB, Cars and Food-101 use the following code and set the dataset argument to food101, cars or cub:

CUDA_VISIBLE_DEVICES=... torchrun --nproc-per-node N --standalone src/open_flamingo_cub_cars_food.py dataset=DATASET dataset_folder=YOUR/DATASET/PATH

Once you have obtained the captions, you can invert the images via:

CUDA_VISIBLE_DEVICES=... torchrun --nproc-per-node N --standalone src/imagenet_inversion.py  guidance_scale=3.0 imagenet_folder=YOUR/PATH/TO/IMAGENET

For Food-101, CUB and Cars you can use:

CUDA_VISIBLE_DEVICES=... torchrun --nproc-per-node N --standalone src/inversion_cub_cars_food.py  guidance_scale=3.0  dataset=DATASET dataset_folder=YOUR/DATASET/PATH

If you are only interested in inverting fewer images per class, you can use the "images_per_class" argument.

Neuron Counterfactuals

To generate neuron counterfactuals starting from real images from the ImageNet validation set, you can run:

Universal Visual Counterfactual Explanations (UVCE)

ImageNet:

CUDA_VISIBLE_DEVICES=... torchrun --nproc-per-node N --standalone src\uvces_imagenet.py regularizers=[latent_background_l2,px_background_l2] regularizers_ws=[25.0,250.0] imagenet_folder=YOUR/PATH/TO/IMAGENET

For CUB, Cars and Food:

CUDA_VISIBLE_DEVICES=... torchrun --nproc-per-node N --standalone src\uvces_cub_cars_food.py regularizers=[latent_background_l2,px_background_l2] regularizers_ws=[25.0,250.0] dataset=DATASET dataset_folder=YOUR/DATASET/PATH results_sub_folder=uvces random_images=True num_images=1000

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
figures		figures
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DiG-IN: Diffusion Guidance for Investigating Networks - Uncovering Classifier Differences, Neuron Visualisations, and Visual Counterfactual Explanations [CVPR2024]

Setup

Synthethic Neuron Activations using CogVLM

Img2Img: Automatic Captioning and Null-Text Inversion

Neuron Counterfactuals

Universal Visual Counterfactual Explanations (UVCE)

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DiG-IN: Diffusion Guidance for Investigating Networks - Uncovering Classifier Differences, Neuron Visualisations, and Visual Counterfactual Explanations [CVPR2024]

Setup

Synthethic Neuron Activations using CogVLM

Img2Img: Automatic Captioning and Null-Text Inversion

Neuron Counterfactuals

Universal Visual Counterfactual Explanations (UVCE)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages