Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ksasi/see-with-sound
https://github.com/ksasi/see-with-sound
Last synced: 22 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/ksasi/see-with-sound
- Owner: ksasi
- Created: 2024-03-29T12:54:22.000Z (7 months ago)
- Default Branch: main
- Last Pushed: 2024-03-31T05:13:10.000Z (7 months ago)
- Last Synced: 2024-04-20T00:49:26.211Z (7 months ago)
- Language: Jupyter Notebook
- Size: 3.25 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# See-with-Sound
![Made With python 3.11.5](https://img.shields.io/badge/Made%20with-Python%203.11.5-brightgreen)![pytorch](https://img.shields.io/badge/Made%20with-pytorch-green.svg)![librosa](https://img.shields.io/badge/Made_with-librosa-blue)![OpenCV](https://img.shields.io/badge/Made_with-OpenCV-orange)
### Code:
Below are the step to setup the code and perform training
### Setup:
After setting up the code as below, update the paths appropriately
> git clone https://github.com/ksasi/See-with-Sound.git
### Install Dependencies:
> cd See-with-Sound
>
> pip install -r requirements.txt### Dataset:
- Download [Food-101](https://data.vision.ee.ethz.ch/cvl/datasets_extra/food-101/) dataset
- Execute ipython notebook `Minor_Project_Data_Curation.ipynb` to generate audio samples for each category
- Execute ipython notebook `Minor_Project_DataSet_Setup.ipynb` to **setup food-101-small** dataset
- The **setup food-101-small** dataset consists of `Train`, `Probe`, `Gallery` and `Other` foldersDataset **setup food-101-small** structure :
```
food-101-small/
Train/
/
.jpg
.jpg
...
...
...
Probe/
/
.wav
Gallery/
/
.jpg
.jpg
...
...
...
Other/
No_Image_Available.jpg
```### Training:
After updating the paths, train `SGDClassifier` incrementally as below :
> nohup python model_train.py &
### Evaluation:
The trained model can be evaluated as below :
> nohup python evaluate.py &
### Results:
The CMC Curve of Probe and Gallery is shown below :
![image](cmc_curve.png)
Rank1 Identification Accuracy: 87.097%
### Demo:
Demo of Image search from audio input can be executed by running `Audio_Image_Search_Demo.ipynb` ipython notebook
![Demo1](SC1.png)
![Demo2](SC2.png)### References
The code is adapted from the following repositories:
- Low Resolution face recognition - [Github Link](https://github.com/ksasi/face-recognition)