Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/ksasi/see-with-sound


https://github.com/ksasi/see-with-sound

Last synced: 17 days ago
JSON representation

Awesome Lists containing this project

README

        

# See-with-Sound

![Made With python 3.11.5](https://img.shields.io/badge/Made%20with-Python%203.11.5-brightgreen)![pytorch](https://img.shields.io/badge/Made%20with-pytorch-green.svg)![librosa](https://img.shields.io/badge/Made_with-librosa-blue)![OpenCV](https://img.shields.io/badge/Made_with-OpenCV-orange)

### Code:

Below are the step to setup the code and perform training

### Setup:

After setting up the code as below, update the paths appropriately

> git clone https://github.com/ksasi/See-with-Sound.git

### Install Dependencies:

> cd See-with-Sound
>
> pip install -r requirements.txt

### Dataset:

- Download [Food-101](https://data.vision.ee.ethz.ch/cvl/datasets_extra/food-101/) dataset
- Execute ipython notebook `Minor_Project_Data_Curation.ipynb` to generate audio samples for each category
- Execute ipython notebook `Minor_Project_DataSet_Setup.ipynb` to **setup food-101-small** dataset
- The **setup food-101-small** dataset consists of `Train`, `Probe`, `Gallery` and `Other` folders

Dataset **setup food-101-small** structure :

```
food-101-small/
Train/
/
.jpg
.jpg
...
...
...
Probe/
/
.wav
Gallery/
/
.jpg
.jpg
...
...
...
Other/
No_Image_Available.jpg
```

### Training:

After updating the paths, train `SGDClassifier` incrementally as below :

> nohup python model_train.py &

### Evaluation:

The trained model can be evaluated as below :

> nohup python evaluate.py &

### Results:

The CMC Curve of Probe and Gallery is shown below :

![image](cmc_curve.png)

Rank1 Identification Accuracy: 87.097%

### Demo:

Demo of Image search from audio input can be executed by running `Audio_Image_Search_Demo.ipynb` ipython notebook

![Demo1](SC1.png)
![Demo2](SC2.png)

### References

The code is adapted from the following repositories:

- Low Resolution face recognition - [Github Link](https://github.com/ksasi/face-recognition)