https://github.com/ajinkyat/lip_reading_in_the_wild_avsr
Audio-Visual Speech Recognition using Deep Learning
https://github.com/ajinkyat/lip_reading_in_the_wild_avsr
Last synced: about 2 months ago
JSON representation
Audio-Visual Speech Recognition using Deep Learning
- Host: GitHub
- URL: https://github.com/ajinkyat/lip_reading_in_the_wild_avsr
- Owner: ajinkyaT
- Created: 2018-06-04T13:37:51.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2018-11-14T15:38:11.000Z (over 6 years ago)
- Last Synced: 2025-04-15T21:52:56.922Z (about 2 months ago)
- Language: Python
- Size: 91 MB
- Stars: 60
- Watchers: 5
- Forks: 21
- Open Issues: 4
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
## Audio-Visual Speech Recognition using Deep Learning
Work done during [Google Summer of Code](https://summerofcode.withgoogle.com)
- ```SyncNet\``` : Current speaker recognition in a video frame using Deep Learning, without any labelled speaker data, as described in the paper, [Out of time: automated lip sync in the wild (SyncNet)](http://www.robots.ox.ac.uk/~vgg/software/lipsync/)- ```LRW_Sentences\``` : AVSR model described in the paper, [Lip Reading Sentences in the Wild](https://arxiv.org/abs/1611.05358)