https://github.com/abdouaziz/wavautoencoder

WavAutoEncoder
https://github.com/abdouaziz/wavautoencoder

asr masked ssl

Last synced: about 2 months ago
JSON representation

WavAutoEncoder

Host: GitHub
URL: https://github.com/abdouaziz/wavautoencoder
Owner: abdouaziz
License: mit
Created: 2022-11-24T21:37:10.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2023-05-08T10:33:18.000Z (about 2 years ago)
Last Synced: 2025-01-30T00:32:12.292Z (4 months ago)
Topics: asr, masked, ssl
Language: Python
Homepage:
Size: 34.2 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        ## Masked WavAutoEncoder 

This repository is an implementation of  masked wavautoencoder based on Transformer architecture for self supervised speech representation . 

## Install

```bash

$ pip install wav-autoencoder

```

## Usage

Simple example for using the model

```python

import torch

from wav_autoencoder import WavAutoEncoderConfig ,  WavAutoEncoderModel 

>> config = WavAutoEncoderConfig()

>> model = WavAutoEncoderModel(config)

>> wav_signal = torch.randn(2, 1, 16000)

>> outputs = model(wav_signal)

>> print(outputs.shape)

```

## Todo

## Citations

```bibtex

@misc{

  title  = {WavAutoencoder: A Self-Supervised Framework for Learning Audio Representations},

  author = {Abdou Aziz DIOP},

  year   = {2022}

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/abdouaziz/wavautoencoder

Awesome Lists containing this project

README