Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/yas-sim/csm_voice_encode_synthesis_python
Expermental code for CSM voice synthesis + CSM data generation
https://github.com/yas-sim/csm_voice_encode_synthesis_python
audio-codec audio-processing composite-sinusoidal-modeling csm fm-sound vocoder voice voice-synthesis voice-synthesizer yamaha ym2203
Last synced: 3 days ago
JSON representation
Expermental code for CSM voice synthesis + CSM data generation
- Host: GitHub
- URL: https://github.com/yas-sim/csm_voice_encode_synthesis_python
- Owner: yas-sim
- License: mit
- Created: 2022-09-18T12:22:35.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2022-09-19T14:13:10.000Z (about 2 years ago)
- Last Synced: 2024-04-23T00:09:50.968Z (7 months ago)
- Topics: audio-codec, audio-processing, composite-sinusoidal-modeling, csm, fm-sound, vocoder, voice, voice-synthesis, voice-synthesizer, yamaha, ym2203
- Language: Jupyter Notebook
- Homepage:
- Size: 14.5 MB
- Stars: 4
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.md
Awesome Lists containing this project
README
# CSM voice encode / synthesis experimental code
CSM stands for composite sinusoidal modeling, an algorithm for the speech synthesizer.
Some of the YAMAHA FM sound devices support this feature, and several game used this feature to speak in the game application.|program|description|
|-|-|
|`csm_encoding.ipynb`|Audio to CSM data converter.
Reads an audio file and generates a CSM data file.|
|`csm_synthesis.ipynb`|CSM synthesizer.
Reads a CSM data file and synthesis an audio file from the CSM data|
|`csm_synthesis_rt.py`|Real-time CSM synthesizer.
Perform real-time CSM voice synthesis. You can change the key (frequency) of the playback sound with keyboard. This feature is something like a [vocoder](https://en.wikipedia.org/wiki/Vocoder). This program repeatedly playback the sound until you press 'q' key.|### csm_synthesis_rt.py run example
```sh
>python csm_synthesis_rt.py
21.82 sec
max amplitude: 123.0367660522461
Real-time CSM audio synthesis.
Keyboard layout - You can change the key of the playing back sound.
| s d g h j |
|z x c v b n m ,|
Press 'q' to quit.```
### Audio samples:[apollo11-original](https://github.com/yas-sim/csm_voice_encode_synthesis_python/blob/main/resources/apollo11_launch.wav?raw=true)
[apollo11-csm](https://github.com/yas-sim/csm_voice_encode_synthesis_python/blob/main/resources/apollo11_launch_out.wav?raw=true)
### Memo:
Convert audio data into mono/32Kbps wav format data.```sh
ffmpeg\bin\ffmpeg.exe -i input.wav -ar 32000 -ac 1 -f wav "output.wav"
```