Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/yas-sim/openvino-sound-classification-demo-rt

Real-time version of sound_classification_demo in OpenVINO toolkit. Captures audio from microphone, do classification, and display result on the screen with illustration.
https://github.com/yas-sim/openvino-sound-classification-demo-rt

audio-classification deep-learning deep-learning-demo demo intel openvino python real-time sound-classification

Last synced: about 2 months ago
JSON representation

Real-time version of sound_classification_demo in OpenVINO toolkit. Captures audio from microphone, do classification, and display result on the screen with illustration.

Host: GitHub
URL: https://github.com/yas-sim/openvino-sound-classification-demo-rt
Owner: yas-sim
Created: 2021-07-19T05:33:29.000Z (over 3 years ago)
Default Branch: master
Last Pushed: 2021-07-28T07:45:13.000Z (over 3 years ago)
Last Synced: 2024-04-23T00:09:53.802Z (9 months ago)
Topics: audio-classification, deep-learning, deep-learning-demo, demo, intel, openvino, python, real-time, sound-classification
Language: Python
Homepage:
Size: 1.42 MB
Stars: 10
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Real-time version of 'sound_classification_demo.py'

### Description:
This is a real-time version of `sound_classificatioin_demo` in the OpenVINO toolkit. The original demo takes an audio file and show the classification result to the console. I modified the demo program. Now the demo program can capture audio stream from microphone and show the classified result on the display with illustration in real-time.
OpenVINO付属の`sound_classification_demo`を改造してリアルタイム版にしたデモです。オリジナルのデモはフィルから音声データを読み込み、推論結果を画面に表示するものでした。これを改造し、マイクから音声を読み込み、リアルタイムに分類し、結果を(ヘタクソな)イラストとともに画面に表示するようにしました。

![image1](resources/result_img.gif)

### Prerequisites:

- OpenVINO 2021.4
- Python modules: `pyaudio`, `numpy`, `opencv-python`
- (Ubuntu) portaudio : `portaudio19-dev`

### How to run:

1. Install Intel OpenVINO toolkit 2021.4
[Download](https://software.intel.com/content/www/us/en/develop/tools/openvino-toolkit/download.html)
[Get Started Guide](https://docs.openvinotoolkit.org/latest/get_started_guides.html)

2. Install Python prerequisites
```sh
python -m pip install --upgrade pip setuptools
python -m pip install -r requirements.in
```
Note: Ubuntu may need to install `portaudio` library. `sudo apt install portaudio19-dev`

3. Download required DL models
```sh
python %INTEL_OPENVINO_DIR%\deployment_tools\open_model_zoo\tools\downloader\downloader.py ^
--list models.lst
python %INTEL_OPENVINO_DIR%\deployment_tools\open_model_zoo\tools\downloader\converter.py ^
--list models.lst --precisions FP16
```

4. Run the demo

```sh
python sound_classification_demo-rt.py ^
-m aclnet.xml ^
--labels .\data\aclnet_53cl.txt ^
--illustration_dir .\data\acl_img
```
Note: Python 3.6 may cause an error on load_network().

### Note:
Tested on OpenVINO 2021.4 (Win10, Ubuntu 20.04)