Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/matin-ghorbani/inception-net-from-scratch

Inception Net Implementation from Scratch using PyTorch
https://github.com/matin-ghorbani/inception-net-from-scratch

deep-learning googlenet inception pytorch

Last synced: about 2 hours ago
JSON representation

Inception Net Implementation from Scratch using PyTorch

Host: GitHub
URL: https://github.com/matin-ghorbani/inception-net-from-scratch
Owner: matin-ghorbani
License: mit
Created: 2024-08-18T17:24:49.000Z (about 1 month ago)
Default Branch: main
Last Pushed: 2024-08-18T17:34:24.000Z (about 1 month ago)
Last Synced: 2024-09-22T06:02:18.384Z (5 days ago)
Topics: deep-learning, googlenet, inception, pytorch
Language: Python
Homepage:
Size: 139 KB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Inception Net Implementation from Scratch using PyTorch

This repository contains an implementation of the Inception Network (GoogleNet) from scratch using PyTorch. The Inception architecture is a type of convolutional neural network (CNN) that was originally proposed by Szegedy et al. in the paper ["Going Deeper with Convolutions"](https://arxiv.org/abs/1409.4842).

![Inception Architecture](./assets/Inception_archpng.png)

## Overview

The Inception architecture introduces a novel approach to convolutional neural networks, where multiple types of convolutional layers (1x1, 3x3, 5x5) and pooling layers are used in parallel in each block. This allows the network to capture features at different scales, leading to improved performance on image classification tasks.

### Model Architecture

Below is a simplified diagram of the Inception Block, which is the core component of the network:

![Inception Block](https://production-media.paperswithcode.com/methods/Screen_Shot_2020-06-22_at_3.22.39_PM.png)

The full network is composed of several such blocks stacked together:

1. **Initial Convolutional Layer:** A basic convolutional layer to process the input image.
2. **Inception Blocks:** A series of Inception Blocks that progressively extract features.
3. **Pooling Layers:** Max pooling layers to reduce the spatial dimensions.
4. **Fully Connected Layer:** A final fully connected layer for classification.

### Code Structure

- **ConvBlock:** A basic convolutional block consisting of a convolutional layer followed by batch normalization and ReLU activation.
- **InceptionBlock:** Implements the Inception module, combining 1x1, 3x3, 5x5 convolutions, and max pooling in parallel.
- **Inception:** The full Inception network, combining multiple Inception blocks.

## Installation

To run this code, you need to have Python and PyTorch installed. You can install the required libraries using:

```bash
pip install torch
```

## Usage

To test the network, you can run the `main()` function in the `inception_net.py` file. The `main()` function initializes the network, creates a random input tensor, and prints the output size.

```bash
python inception_net.py
```

Example Output

```python
y.size() = torch.Size([3, 10])
```

## Customization

You can customize the architecture by modifying the parameters of the Inception blocks in the `Inception` class. For example, you can change the number of output channels for each convolutional path to adjust the capacity of the network.

## Training the Model

This implementation is designed as a template. You can integrate it into your training pipeline by adding a dataset, loss function, and optimizer. Here's a simple example:

```python
# Example Training Loop
import torch.optim as optim

# Initialize model, loss function, and optimizer
model = Inception(3).to(device)
criterion = nn.CrossEntropyLoss()
optimizer = optim.Adam(model.parameters(), lr=0.001)

# Training loop
for epoch in range(num_epochs):
for data in train_loader:
inputs, labels = data
inputs, labels = inputs.to(device), labels.to(device)

optimizer.zero_grad()

outputs = model(inputs)
loss = criterion(outputs, labels)
loss.backward()
optimizer.step()

print(f'Epoch {epoch+1}, Loss: {loss.item()}')
```

## References

Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., ... & Rabinovich, A. (2015). Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1-9).
["Going Deeper with Convolutions"](https://arxiv.org/abs/1409.4842).