Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/yas-sim/openvino-sound-classification-demo-rt
Real-time version of sound_classification_demo in OpenVINO toolkit. Captures audio from microphone, do classification, and display result on the screen with illustration.
https://github.com/yas-sim/openvino-sound-classification-demo-rt
audio-classification deep-learning deep-learning-demo demo intel openvino python real-time sound-classification
Last synced: about 2 hours ago
JSON representation
Real-time version of sound_classification_demo in OpenVINO toolkit. Captures audio from microphone, do classification, and display result on the screen with illustration.
- Host: GitHub
- URL: https://github.com/yas-sim/openvino-sound-classification-demo-rt
- Owner: yas-sim
- Created: 2021-07-19T05:33:29.000Z (over 3 years ago)
- Default Branch: master
- Last Pushed: 2021-07-28T07:45:13.000Z (over 3 years ago)
- Last Synced: 2024-04-23T00:09:53.802Z (7 months ago)
- Topics: audio-classification, deep-learning, deep-learning-demo, demo, intel, openvino, python, real-time, sound-classification
- Language: Python
- Homepage:
- Size: 1.42 MB
- Stars: 10
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Real-time version of 'sound_classification_demo.py'
### Description:
This is a real-time version of `sound_classificatioin_demo` in the OpenVINO toolkit. The original demo takes an audio file and show the classification result to the console. I modified the demo program. Now the demo program can capture audio stream from microphone and show the classified result on the display with illustration in real-time.
OpenVINO付属の`sound_classification_demo`を改造してリアルタイム版にしたデモです。オリジナルのデモはフィルから音声データを読み込み、推論結果を画面に表示するものでした。これを改造し、マイクから音声を読み込み、リアルタイムに分類し、結果を(ヘタクソな)イラストとともに画面に表示するようにしました。![image1](resources/result_img.gif)
### Prerequisites:
- OpenVINO 2021.4
- Python modules: `pyaudio`, `numpy`, `opencv-python`
- (Ubuntu) portaudio : `portaudio19-dev`### How to run:
1. Install Intel OpenVINO toolkit 2021.4
[Download](https://software.intel.com/content/www/us/en/develop/tools/openvino-toolkit/download.html)
[Get Started Guide](https://docs.openvinotoolkit.org/latest/get_started_guides.html)2. Install Python prerequisites
```sh
python -m pip install --upgrade pip setuptools
python -m pip install -r requirements.in
```
Note: Ubuntu may need to install `portaudio` library. `sudo apt install portaudio19-dev`3. Download required DL models
```sh
python %INTEL_OPENVINO_DIR%\deployment_tools\open_model_zoo\tools\downloader\downloader.py ^
--list models.lst
python %INTEL_OPENVINO_DIR%\deployment_tools\open_model_zoo\tools\downloader\converter.py ^
--list models.lst --precisions FP16
```4. Run the demo
```sh
python sound_classification_demo-rt.py ^
-m aclnet.xml ^
--labels .\data\aclnet_53cl.txt ^
--illustration_dir .\data\acl_img
```
Note: Python 3.6 may cause an error on load_network().### Note:
Tested on OpenVINO 2021.4 (Win10, Ubuntu 20.04)