https://github.com/gagan3012/image2audio
Convert Image to audio using ViT, GPT and FastSpeech
https://github.com/gagan3012/image2audio
fastspeech2 gpt-2 image-captioning imagecaptioning pytorch speech-to-text vit
Last synced: about 2 months ago
JSON representation
Convert Image to audio using ViT, GPT and FastSpeech
- Host: GitHub
- URL: https://github.com/gagan3012/image2audio
- Owner: gagan3012
- License: mit
- Created: 2022-02-07T09:09:33.000Z (over 4 years ago)
- Default Branch: master
- Last Pushed: 2026-04-06T18:26:56.000Z (2 months ago)
- Last Synced: 2026-04-06T20:23:49.369Z (2 months ago)
- Topics: fastspeech2, gpt-2, image-captioning, imagecaptioning, pytorch, speech-to-text, vit
- Language: Python
- Homepage:
- Size: 30.3 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Codeowners: .github/CODEOWNERS
Awesome Lists containing this project
README
# image2audio
My personal python projects template
Will be used in all new projects by me