https://github.com/zimmerrol/vggvox-pytorch
Implementation of the VGGVox network in PyTorch.
https://github.com/zimmerrol/vggvox-pytorch
Last synced: about 1 year ago
JSON representation
Implementation of the VGGVox network in PyTorch.
- Host: GitHub
- URL: https://github.com/zimmerrol/vggvox-pytorch
- Owner: zimmerrol
- License: mit
- Created: 2018-08-27T05:23:10.000Z (almost 8 years ago)
- Default Branch: master
- Last Pushed: 2018-08-27T05:38:33.000Z (almost 8 years ago)
- Last Synced: 2025-05-07T23:43:20.575Z (about 1 year ago)
- Language: Python
- Homepage:
- Size: 5.86 KB
- Stars: 8
- Watchers: 1
- Forks: 3
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# VGGVox for `pytorch`
Implementation of the `VGGVox` network using `pytorch`. The implementation is based on the descriptions given in the papers
-A. Nagrani, J. S. Chung, A. Zisserman, VoxCeleb: a large-scale speaker identification dataset, INTERSPEECH, 2017
-S. Albanie, A. Nagrani, A. Vedaldi: Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
# Content
This repository contains the implementation of the `VGGVox` network itself, some utility functions for audio processing and an example `DataLoader` for audio files.