https://github.com/muthukamalan/dogbreedsclassifier

helps to classify dogs breed
https://github.com/muthukamalan/dogbreedsclassifier
aws docker fastapi gradio html huggingface hydra lightning onnx prouction pytorch torchscript torchserve
Last synced: 4 months ago
JSON representation
helps to classify dogs breed
Host: GitHub
URL: https://github.com/muthukamalan/dogbreedsclassifier
Owner: Muthukamalan
License: mpl-2.0
Created: 2024-11-21T07:27:29.000Z (7 months ago)
Default Branch: main
Last Pushed: 2024-12-19T19:01:16.000Z (6 months ago)
Last Synced: 2025-01-01T23:18:58.193Z (6 months ago)
Topics: aws, docker, fastapi, gradio, html, huggingface, hydra, lightning, onnx, prouction, pytorch, torchscript, torchserve
Language: Python
Homepage:
Size: 6.21 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project

README

        


# DogBreedsClassifier

[![python](https://img.shields.io/badge/-Python_3.8_%7C_3.9_%7C_3.10-blue?logo=python&logoColor=white)](https://github.com/pre-commit/pre-commit)

[![pytorch](https://img.shields.io/badge/PyTorch_2.0+-ee4c2c?logo=pytorch&logoColor=white)](https://pytorch.org/get-started/locally/)

[![lightning](https://img.shields.io/badge/-Lightning_2.0+-792ee5?logo=pytorchlightning&logoColor=white)](https://pytorchlightning.ai/)

[![hydra](https://img.shields.io/badge/Config-Hydra_1.3-89b8cd)](https://hydra.cc/)

[![black](https://img.shields.io/badge/Code%20Style-Black-black.svg?labelColor=gray)](https://black.readthedocs.io/en/stable/)

[![isort](https://img.shields.io/badge/%20imports-isort-%231674b1?style=flat&labelColor=ef8336)](https://pycqa.github.io/isort/) 

![Huggingface](https://img.shields.io/badge/-HuggingFace-FDEE21?style=for-the-badge&logo=HuggingFace&logoColor=black)  


[![codecov](https://codecov.io/gh/ashleve/lightning-hydra-template/branch/main/graph/badge.svg)](https://codecov.io/gh/ashleve/lightning-hydra-template)  


![AWS](https://img.shields.io/badge/AWS-%23FF9900.svg?style=for-the-badge&logo=amazon-aws&logoColor=white)

![DVC](https://img.shields.io/badge/DVC-945DD6?style=for-the-badge&logo=dvc&logoColor=white)

![Amazon S3](https://img.shields.io/badge/Amazon%20S3-FF9900?style=for-the-badge&logo=amazons3&logoColor=white)

![Docker](https://img.shields.io/badge/docker-%230db7ed.svg?style=for-the-badge&logo=docker&logoColor=white) 


![GitHub Actions](https://img.shields.io/badge/github%20actions-%232671E5.svg?style=for-the-badge&logo=githubactions&logoColor=white)

![VS Code Insiders](https://img.shields.io/badge/VS%20Code%20Insiders-35b393.svg?style=for-the-badge&logo=visual-studio-code&logoColor=white)

![Git](https://img.shields.io/badge/git-%23F05033.svg?style=for-the-badge&logo=git&logoColor=white)

![GitHub](https://img.shields.io/badge/github-%23121011.svg?style=for-the-badge&logo=github&logoColor=white)

![Kaggle](https://img.shields.io/badge/Kaggle-035a7d?style=for-the-badge&logo=kaggle&logoColor=white) 


[![license](https://img.shields.io/badge/License-MIT-green.svg?labelColor=gray)](https://github.com/ashleve/lightning-hydra-template#license)

## Main Technologies

[PyTorch Lightning](https://github.com/PyTorchLightning/pytorch-lightning) - a lightweight PyTorch wrapper for high-performance AI research. Think of it as a framework for organizing your PyTorch code.

[Hydra](https://github.com/facebookresearch/hydra) - a framework for elegantly configuring complex applications. The key feature is the ability to dynamically create a hierarchical configuration by composition and override it through config files and the command line.

[DVC](https://dvc.org/) - A tool designed to handle large datasets and machine learning models in a version-controlled workflow

[Tensorboard](https://www.tensorflow.org/tensorboard) - TensorBoard is a tool that provides visualization and debugging capabilities for TensorFlow and PyTorch experiments. It’s a popular choice for monitoring machine learning training processes in real time.

[AWS|EC2|S3|Lambda|ECR](https://aws.amazon.com/ec2/) - AWS Elastic Compute Cloud (EC2) is a service that provides scalable virtual computing resources in the cloud.

[Docker](https://www.docker.com/) - A platform for creating, deploying, and managing lightweight, portable, and scalable containers.

[Gradio](https://www.gradio.app/) - A Python library for building simple, interactive web interfaces for machine learning models and APIs.

## WORKFLows

[![TrainPipeline](https://github.com/Muthukamalan/DogBreedsClassifier/actions/workflows/ci-train.yml/badge.svg)](https://github.com/Muthukamalan/DogBreedsClassifier/actions/workflows/ci-train.yml)

[![EvalPipeline](https://github.com/Muthukamalan/DogBreedsClassifier/actions/workflows/ci-eval.yml/badge.svg)](https://github.com/Muthukamalan/DogBreedsClassifier/actions/workflows/ci-eval.yml)

[![InferPipeline](https://github.com/Muthukamalan/DogBreedsClassifier/actions/workflows/ci-infer.yml/badge.svg)](https://github.com/Muthukamalan/DogBreedsClassifier/actions/workflows/ci-infer.yml)

[![PYTestPipeline](https://github.com/Muthukamalan/DogBreedsClassifier/actions/workflows/cd-codecov.yml/badge.svg)](https://github.com/Muthukamalan/DogBreedsClassifier/actions/workflows/cd-codecov.yml)

[![GradiDeploy](https://github.com/Muthukamalan/DogBreedsClassifier/actions/workflows/cd-deploy.yml/badge.svg)](https://github.com/Muthukamalan/DogBreedsClassifier/actions/workflows/cd-deploy.yml)



## Project Structure 
```bash 
. 
├── .devcontainer 
│   └── devcontainer.json 
| 
├── .github 
│   ├── ci-eval.yml 
│   ├── ci-codecov.yml 
│   ├── ci-test.yml 
│   ├── ci-train.yml 
│   └── ci-deploy.yml 
├── assets 
│   ├── hparams-artifacts.png 
│   ├── MambaOutHparamSearch.png 
│   ├── 
│   ├── OptunaHparams.png 
│   ├── runner-ec2-training.png 
│   └── self-hosted-runners.png 
| 
├── configs 
│      ├── callbacks 
│      │   ├── default.yaml 
│      │ 
│      │ 
|      │   ├── 
│      │   ├── model_summary.yaml 
│      │   ├── none.yaml 
│      │ 
│      ├── data 
│      │   └── dogs.yaml 
│      ├── debug 
│      │   ├── default.yaml 
│      │   ├── fdr.yaml 
│      │   ├── limit.yaml 
│      │   ├── overfit.yaml 
│      │   └── profiler.yaml 
│      ├── experiment 
│      │   └── finetune.yaml 
│      ├── extras 
│      │   └── default.yaml 
│      ├── hparams_search 
│      │   └── mnist_optuna.yaml 
│      ├── hydra 
│      │   └── default.yaml 
│      ├── logger 
│      │   ├── aim.yaml 
│      │   ├── comet.yaml 
│      │   ├── csv.yaml 
│      │   ├── default.yaml 
│      │   ├── many_loggers.yaml 
│      │   ├── mlflow.yaml 
│      │   ├── neptune.yaml 
│      │   ├── tensorboard.yaml 
│      │   └── wandb.yaml 
│      ├── model 
│      │   ├── mamba.yaml 
│      │   ├── mnist.yaml 
│      │   └── timm_classify.yaml 
│      ├── paths 
│      │   └── default.yaml 
│      ├── trainer 
│      │   ├── cpu.yaml 
│      │   ├── ddp_sim.yaml 
│      │   ├── ddp.yaml 
│      │   ├── default.yaml 
│      │   ├── gpu.yaml 
│      │   └── mps.yaml 
│      ├── __init__.py 
│      ├── eval.yaml 
│      └── train.yaml 
| 
├── data 
│   ├── dogs_dataset 
│   │   ├── test 
│   │   ├── train 
│   │   └── validation 
│   └── dogs_dataset.dvc 
├── dvc.lock 
├── dvc.yaml 
├── environment.yaml 
| 
| 
├── LICENSE 
├── logs 
├── multirun 
├── outputs 
| 
├── notebooks 
| 
├── reports 
│   ├── lr-Adam.png 
│   ├── test-report.png 
│   ├── train-report.png 
│   └── val-report.png 
| 
| 
├── samples 
│   ├── checkpoints 
│   │   └── epoch_019.ckpt 
│   ├── inputs 
│   │   ├── guess1.jpg 
│   │   └── guess2.jpg 
│   └── outputs 
| 
├── scripts 
├── setup.py 
| 
├── src 
│   ├── datamodules 
│   │   └── dogs_datamodule.py 
│   ├── models 
│   │   └── dogs_classifier.py 
│   ├── utils 
│   │   ├── __init__.py 
│   │   ├── instantiators.py 
│   │   ├── logging_utils.py 
│   │   ├── pylogger.py 
│   │   ├── rich_utils.py 
│   │   └── utils.py 
│   ├── __init__.py 
│   ├── inference.py 
│   ├── train.py 
|   └── eval.py 
| 
├── gradio 
│   ├── .gradio/worflows 
│   │    └── 
│   ├── examples 
│   │   ├── guess1.jpg 
│   │   └── guess2.jpg 
│   ├── app.py 
│   ├── best_model.pt 
│   ├── dvc.lock 
│   ├── README.md 
│   └── requirements.txt 
| 
├── tests 
│   ├── datamodules 
│   │   └── 
│   ├── models 
│   │   └── 
│   ├── test_eval.py 
│   └── test_train.py 
│ 
├── Makefile 
├── requirements.txt 
├── requirements.txt.cpu 
| 
├── Dockerfile 
├── Dockerfile.cpu 
├── compose.yml 
│ 
├── pyproject.toml 
├── ruff.toml 
├── pytest.ini 
| 
├── .env 
├── coverage.xml 
| 
└── README.md

<- vscode <- Github Actions workflows MambaOutHparamsTestScores.png <- Hydra configs <- callback config ├── early_stopping.yaml ├── learning_rate_monitor.yaml model_checkpoint.yaml └── rich_progress_bar.yaml <- data config <- debug config <- experiment config <- extras config <- hparams config <- hydra config <- logger config <- model config <- path config <- trainer config <- evalution config <- training config <- DATASET <- DVC <- conda export `conda env export|grep -v "^prefix: " > environment.yml` <- Logs generated by hydra and lightning loggers <- Logs for Hparams Search <- Logs for eval/fastrun <- Jupyter notebooks <- inference <- Shell scripts

<- GRADIO Space Huggingspace update-space.yaml <- examples <- Pytest test_dogs_datamodule.py test_dogs_classifier.py <- requirements+GPU <- requirements+CPU <- Dockerfile+GPU <- Dockerfile+CPU <- docker-compose

<- ruff check --fix <- pytest config

79 directories, 107 files

```

## Logs

Hydra creates new output directory for every executed run.

Default logging structure:

```log

├── logs

│   ├── task_name

│   │   ├── runs                        # Logs generated by single runs

│   │   │   ├── YYYY-MM-DD_HH-MM-SS       # Datetime of the run

│   │   │   │   ├── .hydra                  # Hydra logs

│   │   │   │   ├── csv                     # Csv logs

│   │   │   │   ├── wandb                   # Weights&Biases logs

│   │   │   │   ├── checkpoints             # Training checkpoints

│   │   │   │   └── ...                     # Any other thing saved during training

│   │   │   └── ...

│   │   │

│   │   └── multiruns                   # Logs generated by multiruns

│   │       ├── YYYY-MM-DD_HH-MM-SS       # Datetime of the multirun

│   │       │   ├──1                        # Multirun job number

│   │       │   ├──2

│   │       │   └── ...

│   │       └── ...

│   │

│   └── debugs                          # Logs generated when debugging config is attached

│       └── ...

```

## Data Setup

```.env

#AWS

AWS_ACCESS_KEY_ID= 

AWS_SECRET_ACCESS_KEY=

#Dockerhub

DOCKER_USERNAME=

DOCKER_PASSWORD=

#Code-coverage

CODECOV=

#HF

HF_TOKEN=

```

# Runner-Setup



 


## Clean 

```sh

make trash

make clean

```

## Training

#### fastrun

training simple model

```sh

make fastrun

make sshow

```

### Hparms:: Optuna



 


##### Loss & Accuracy Curves

- Train DataLoader

- Val DataLoader



 

- Test DataLoader



 


#### LearningRate

![lr-adam](reports/lr-Adam.png)

### Artifacts in S3 🪣



 


## Test- PyTest

```sh

make test

============================================================================== test session starts ==============================================================================

platform linux -- Python 3.11.9, pytest-8.3.3, pluggy-1.5.0

rootdir: /home/muthu/GitHub/DogBreedsClassifier

configfile: pytest.ini

plugins: cov-5.0.0, anyio-3.7.1, time-machine-2.15.0, hydra-core-1.3.2

collected 6 items                                                                                                                                                               

tests/datamodules/test_dogs_datamodule.py ...                                                                                                                             [ 50%]

tests/models/test_dogs_classifier.py .                                                                                                                                    [ 66%]

tests/test_eval.py .                                                                                                                                                      [ 83%]

tests/test_train.py .                                                                                                                                                     [100%]

=========================================================================================== warnings summary ============================================================================================

../../miniconda3/envs/venv/lib/python3.11/site-packages/jupyter_client/connect.py:22

  /home/muthu/miniconda3/envs/venv/lib/python3.11/site-packages/jupyter_client/connect.py:22: DeprecationWarning: Jupyter is migrating its paths to use standard platformdirs

  given by the platformdirs library.  To remove this warning and

  see the appropriate new directories, set the environment variable

  `JUPYTER_PLATFORM_DIRS=1` and then run `jupyter --paths`.

  The use of platformdirs will be the default in `jupyter_core` v6

    from jupyter_core.paths import jupyter_data_dir, jupyter_runtime_dir, secure_write

tests/test_eval.py::test_catdog_ex_testing

tests/test_train.py::test_catdog_ex_training

  /home/muthu/miniconda3/envs/venv/lib/python3.11/site-packages/lightning/fabric/connector.py:571: `precision=16` is supported for historical reasons but its usage is discouraged. Please set your precision to 16-mixed instead!

tests/test_train.py::test_catdog_ex_training

  /home/muthu/miniconda3/envs/venv/lib/python3.11/site-packages/lightning/pytorch/loops/fit_loop.py:298: The number of training batches (8) is smaller than the logging interval Trainer(log_every_n_steps=50). Set a lower value for log_every_n_steps if you want to see logs for the training epoch.

tests/test_train.py::test_catdog_ex_training

  /home/muthu/miniconda3/envs/venv/lib/python3.11/site-packages/torch/optim/lr_scheduler.py:224: UserWarning: Detected call of `lr_scheduler.step()` before `optimizer.step()`. In PyTorch 1.1.0 and later, you should call them in the opposite order: `optimizer.step()` before `lr_scheduler.step()`.  Failure to do this will result in PyTorch skipping the first value of the learning rate schedule. See more details at https://pytorch.org/docs/stable/optim.html#how-to-adjust-learning-rate

    warnings.warn(

-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html

=========================================================================================== warnings summary ============================================================================================

../../miniconda3/envs/venv/lib/python3.11/site-packages/jupyter_client/connect.py:22

  /home/muthu/miniconda3/envs/venv/lib/python3.11/site-packages/jupyter_client/connect.py:22: DeprecationWarning: Jupyter is migrating its paths to use standard platformdirs

  given by the platformdirs library.  To remove this warning and

  see the appropriate new directories, set the environment variable

  `JUPYTER_PLATFORM_DIRS=1` and then run `jupyter --paths`.

  The use of platformdirs will be the default in `jupyter_core` v6

    from jupyter_core.paths import jupyter_data_dir, jupyter_runtime_dir, secure_write

tests/test_eval.py::test_catdog_ex_testing

tests/test_train.py::test_catdog_ex_training

  /home/muthu/miniconda3/envs/venv/lib/python3.11/site-packages/lightning/fabric/connector.py:571: `precision=16` is supported for historical reasons but its usage is discouraged. Please set your precision to 16-mixed instead!

tests/test_train.py::test_catdog_ex_training

  /home/muthu/miniconda3/envs/venv/lib/python3.11/site-packages/lightning/pytorch/loops/fit_loop.py:298: The number of training batches (8) is smaller than the logging interval Trainer(log_every_n_steps=50). Set a lower value for log_every_n_steps if you want to see logs for the training epoch.

tests/test_train.py::test_catdog_ex_training

  /home/muthu/miniconda3/envs/venv/lib/python3.11/site-packages/torch/optim/lr_scheduler.py:224: UserWarning: Detected call of `lr_scheduler.step()` before `optimizer.step()`. In PyTorch 1.1.0 and later, you should call them in the opposite order: `optimizer.step()` before `lr_scheduler.step()`.  Failure to do this will result in PyTorch skipping the first value of the learning rate schedule. See more details at https://pytorch.org/docs/stable/optim.html#how-to-adjust-learning-rate

    warnings.warn(

-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html

==================================================================================== 6 passed, 5 warnings in 33.11s =====================================================================================

```

## Eval

#### confusion matrix

| Train Matrix    | Val Matrix | Test  Matrix  |

|-----------------|------------|-------------|

| 
 | 
 | 
 |

## prediction

```

args:

    --input_folder

    --output_folder # where to save

    --ckpt_path

```



    

    



## Clean

```sh

make trash

make clean

```

## Inference

```log

2024-11-10 20:22:17 | INFO     | utils.logging_utils:wrapper:22 - Starting load_image

2024-11-10 20:22:17 | INFO     | utils.logging_utils:wrapper:25 - Finished load_image

2024-11-10 20:22:17 | INFO     | utils.logging_utils:wrapper:22 - Starting infer

2024-11-10 20:22:17 | INFO     | utils.logging_utils:wrapper:25 - Finished infer

2024-11-10 20:22:17 | INFO     | utils.logging_utils:wrapper:22 - Starting save_prediction_image

2024-11-10 20:22:17 | INFO     | utils.logging_utils:wrapper:25 - Finished save_prediction_image

 "conv_ratio":         1.2

"depths":             [3, 3, 15, 3]

2024-11-10 20:22:17 | INFO     | utils.logging_utils:wrapper:22 - Starting load_image

"dims":               [6, 12, 24, 36]

"head_fn":            default

"in_chans":           3

"lr":                 0.001

"min_lr":             1e-06

"model_name":         Mamba

"num_classes":        10

"pretrained":         False

"scheduler_factor":   0.1

"scheduler_patience": 5

"trainable":          False

"weight_decay":       1e-05

Processed guess2.jpg: Poodle (0.89)

2024-11-10 20:22:17 | INFO     | utils.logging_utils:wrapper:25 - Finished load_image

2024-11-10 20:22:17 | INFO     | utils.logging_utils:wrapper:22 - Starting infer

2024-11-10 20:22:17 | INFO     | utils.logging_utils:wrapper:25 - Finished infer

2024-11-10 20:22:17 | INFO     | utils.logging_utils:wrapper:22 - Starting save_prediction_image

2024-11-10 20:22:17 | INFO     | utils.logging_utils:wrapper:25 - Finished save_prediction_image

Processed guess1.jpg: Boxer (0.96)

2024-11-10 20:22:17 | INFO     | utils.logging_utils:wrapper:25 - Finished main

```

## Gradio

![Gradio](./assets/gradio-inference.png)
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/muthukamalan/dogbreedsclassifier

Awesome Lists containing this project

README