Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/v-iashin/SpecVQGAN
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
https://github.com/v-iashin/SpecVQGAN
audio audio-generation bmvc evaluation-metrics gan melgan multi-modal pytorch transformer vas vggsound video video-features video-understanding vqvae
Last synced: 7 days ago
JSON representation
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
- Host: GitHub
- URL: https://github.com/v-iashin/SpecVQGAN
- Owner: v-iashin
- License: mit
- Created: 2021-10-17T11:20:59.000Z (about 3 years ago)
- Default Branch: main
- Last Pushed: 2024-07-12T09:05:58.000Z (4 months ago)
- Last Synced: 2024-08-02T16:51:53.798Z (3 months ago)
- Topics: audio, audio-generation, bmvc, evaluation-metrics, gan, melgan, multi-modal, pytorch, transformer, vas, vggsound, video, video-features, video-understanding, vqvae
- Language: Jupyter Notebook
- Homepage: https://v-iashin.github.io/SpecVQGAN
- Size: 163 MB
- Stars: 335
- Watchers: 8
- Forks: 37
- Open Issues: 9
-
Metadata Files:
- Readme: README.md
- License: LICENSE