Deep Learning Coursework: Adversarial Attacks & Vision Transformers (ViT)

This repository contains the solution for Assignment 1 of the Deep Learning course at the University of Tehran, focusing on image classification, adversarial attacks, and defensive techniques.

The project explores the robustness of ResNet models against noise and contrasts it with the performance of Vision Transformers (ViT). A significant part of the work involves implementing adversarial attacks (like FGSM) and evaluating defensive methods, specifically adversarial training.

🚀 Project Goals

This assignment was designed to provide hands-on experience with:

Implementing and training standard models like ResNet on image datasets.
Evaluating model robustness against simple perturbations like Gaussian Noise.
Understanding and implementing Adversarial Attacks to exploit model vulnerabilities.
Applying Defensive Techniques (e.g., Adversarial Training) to build more robust models.
Fine-tuning and training Vision Transformers (ViT) and comparing their behavior to CNNs.

📂 Repository Structure

/Q1.ipynb: The main Jupyter Notebook containing all the code, training loops, attack implementations, and visualizations.
/Q1.pdf: The detailed Persian report (گزارش کار) explaining the theory, methodology, and results.
README.md: This file.

📊 Key Findings & Visualizations

We analyzed the models' performance not just on accuracy, but on why they make certain decisions, especially under attack.

1. ResNet Robustness to Noise

We first established a baseline by training a ResNet model. We found that adding simple Gaussian noise significantly degraded performance, highlighting the sensitivity of standard models.

2. Vision Transformer (ViT) Performance

We then trained two ViT models: one fine-tuned from pre-trained weights and one trained from scratch. The fine-tuned model achieved superior results, demonstrating the power of transfer learning.

[Image Placeholder]

3. Adversarial Attacks & Defense

This was the core of the project. We observed that standard models are extremely vulnerable to adversarial attacks, even when invisible to the human eye.

Our key result, shown through Grad-CAM, is that Adversarial Training fundamentally changes how the model "sees" an image.

Standard Model (ViT-Finetuned): Focuses on small, high-frequency textures (e.g., a few specific petals). This is a "brittle" strategy.
Defended Model (ViT-Finetuned-Adv): Learns to look at the overall, holistic shape of the object (e.g., the entire cluster of flowers). This is a much more robust and human-like strategy.

[Image Placeholder]

🛠️ Getting Started

To run this project locally, ensure you have the necessary libraries.

Prerequisites

Python 3.9+
PyTorch
Torchvision
NumPy
Matplotlib
Tqdm

Installation

Clone the repository:

git clone [https://github.com/](https://github.com/)[YourUsername]/[Your-Repo-Name].git
cd [Your-Repo-Name]

Create a virtual environment (Recommended):

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies: (I am assuming these based on your Q1.ipynb imports)
```
pip install torch torchvision numpy matplotlib tqdm jupyter
```

Usage

All the code is contained within the Jupyter Notebook:

jupyter notebook Q1.ipynb

You can run the cells sequentially to reproduce the training, attacks, and visualizations.

🙏 Acknowledgements

Course: Deep Learning (Neural Networks) - University of Tehran
Authors:
- Ali Ghorbani Bargani (810103209)
- Mobin Tirafkan (810103091)

📜 License

This project is licensed under the MIT License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Learning Coursework: Adversarial Attacks & Vision Transformers (ViT)

🚀 Project Goals

📂 Repository Structure

📊 Key Findings & Visualizations

1. ResNet Robustness to Noise

2. Vision Transformer (ViT) Performance

3. Adversarial Attacks & Defense

🛠️ Getting Started

Prerequisites

Installation

Usage

🙏 Acknowledgements

📜 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
Q1.ipynb		Q1.ipynb
Q1.pdf		Q1.pdf
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Deep Learning Coursework: Adversarial Attacks & Vision Transformers (ViT)

🚀 Project Goals

📂 Repository Structure

📊 Key Findings & Visualizations

1. ResNet Robustness to Noise

2. Vision Transformer (ViT) Performance

3. Adversarial Attacks & Defense

🛠️ Getting Started

Prerequisites

Installation

Usage

🙏 Acknowledgements

📜 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages