Effect of Adversarial Attacks on Saliency Map Explanations of Deep Residual Neural Networks 🧠

📋 Table of Contents

About the Project
Prerequisites
Setup and Installation
Usage

1. About the Project

This repository investigates the impact of adversarial attacks on the explainability of Deep Residual Neural Networks (ResNets), specifically focusing on Saliency Map explanations.

The project systematically generates adversarial examples using various methods, calculates the corresponding Saliency Maps for both the original and adversarial inputs, and quantifies the change in these explanations using a set of comparative metrics. The goal is to assess the robustness of Saliency Maps as a trustworthy explainability method in the presence of minor input perturbations.

2. Prerequisites

You need Python 3.13 and a working environment manager (like conda or venv) to run the experiments.

3. Setup and Installation

3.1. Environment Setup

The necessary dependencies are listed in the requirements.txt file.

If you use Conda, you can set up the required environment using the following commands:

Create a Conda environment

conda create -n adv_saliency python=3.13

Activate the environment:
```
conda activate adv_saliency
```
Install dependencies using the provided requirements.txt file:
```
pip install -r requirements.txt
```
Deactivate the environment when finished:
```
conda deactivate
```

4. Usage

1. Pipeline Execution

The core logic is divided into two Python scripts handling the adversarial generation and metric calculation. The results are stored in CSV files.

1.1. Untargeted Attacks

The batch_wise_pipeline.py script generates untargeted adversarial examples for various attacks available in the Foolbox library. It computes saliency maps and comparison metrics for each image. The script takes a command-line argument indicating the index of the attack-group to use (all attacks are given in a list of dictionaries, containing the foolbox attack and the epsilon values to use for the adversarial attack).

1.2. Targeted Attacks

The targeted_batch_wise_pipeline.py script generates targeted adversarial examples for various attacks available in the Foolbox library. It computes saliency maps and comparison metrics for each image. The script takes a command-line argument indicating the index of the attack-group to use (all attacks are given in a list of dictionaries, containing the foolbox attack and the epsilon values to use for the adversarial attack).

2. Metric Analysis

The analysis of the calculated metrics is executed in various jupyter notebooks ranging from EDA to clustering of the methods/attacks.

3. Modifications

3.1. Other datasets

Different datasets can be used by editing the load_and_transform_images() function in the batch_wise_pipeline.py. The function should return a tensor of preprocessed images and a tensor of labels (class_ids)

3.2. Other models

Other models can be used by modifying the following code (using a pytorch model)

weights = ResNet50_Weights.DEFAULT
model = resnet50(weights=weights).eval()
model = model.to(device)

preprocess = weights.transforms()
bounds = (-2.4, 2.8)

fmodel = fb.PyTorchModel(model, bounds=bounds, preprocessing=None)

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.gitignore		.gitignore
README.md		README.md
all_groups.sbatch		all_groups.sbatch
batch_wise_pipeline.py		batch_wise_pipeline.py
batchwise.sbatch		batchwise.sbatch
image_visualisation.py		image_visualisation.py
requirements.txt		requirements.txt
single_group.sbatch		single_group.sbatch
targeted_analysis.ipynb		targeted_analysis.ipynb
targeted_batch_pipeline.sbatch		targeted_batch_pipeline.sbatch
targeted_batch_wise_pipeline.py		targeted_batch_wise_pipeline.py
targeted_eda.ipynb		targeted_eda.ipynb
targeted_grouping.ipynb		targeted_grouping.ipynb
untargeted_analysis.ipynb		untargeted_analysis.ipynb
untargeted_eda.ipynb		untargeted_eda.ipynb
untargeted_grouping.ipynb		untargeted_grouping.ipynb
visualisation.sbatch		visualisation.sbatch

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Effect of Adversarial Attacks on Saliency Map Explanations of Deep Residual Neural Networks 🧠

📋 Table of Contents

1. About the Project

2. Prerequisites

3. Setup and Installation

3.1. Environment Setup

4. Usage

1. Pipeline Execution

1.1. Untargeted Attacks

1.2. Targeted Attacks

2. Metric Analysis

3. Modifications

3.1. Other datasets

3.2. Other models

About

Uh oh!

Releases

Packages

Languages

Jakob09/Masterarbeit

Folders and files

Latest commit

History

Repository files navigation

Effect of Adversarial Attacks on Saliency Map Explanations of Deep Residual Neural Networks 🧠

📋 Table of Contents

1. About the Project

2. Prerequisites

3. Setup and Installation

3.1. Environment Setup

4. Usage

1. Pipeline Execution

1.1. Untargeted Attacks

1.2. Targeted Attacks

2. Metric Analysis

3. Modifications

3.1. Other datasets

3.2. Other models

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages