https://github.com/ajinkyat/lip_reading_in_the_wild_avsr

Audio-Visual Speech Recognition using Deep Learning
https://github.com/ajinkyat/lip_reading_in_the_wild_avsr

Last synced: about 2 months ago
JSON representation

Audio-Visual Speech Recognition using Deep Learning

Host: GitHub
URL: https://github.com/ajinkyat/lip_reading_in_the_wild_avsr
Owner: ajinkyaT
Created: 2018-06-04T13:37:51.000Z (about 7 years ago)
Default Branch: master
Last Pushed: 2018-11-14T15:38:11.000Z (over 6 years ago)
Last Synced: 2025-04-15T21:52:56.922Z (about 2 months ago)
Language: Python
Size: 91 MB
Stars: 60
Watchers: 5
Forks: 21
Open Issues: 4
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

        ## Audio-Visual Speech Recognition using Deep Learning

Work done during [Google Summer of Code](https://summerofcode.withgoogle.com)

- ```SyncNet\``` : Current speaker recognition in a video frame using Deep Learning, without any labelled speaker data, as described in the paper, [Out of time: automated lip sync in the wild (SyncNet)](http://www.robots.ox.ac.uk/~vgg/software/lipsync/)

- ```LRW_Sentences\``` : AVSR model described in the paper, [Lip Reading Sentences in the Wild](https://arxiv.org/abs/1611.05358)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ajinkyat/lip_reading_in_the_wild_avsr

Awesome Lists containing this project

README