https://github.com/gagan3012/image2audio
Convert Image to audio using ViT, GPT and FastSpeech
https://github.com/gagan3012/image2audio
fastspeech2 gpt-2 image-captioning imagecaptioning pytorch speech-to-text vit
Last synced: about 2 months ago
JSON representation
Convert Image to audio using ViT, GPT and FastSpeech
- Host: GitHub
- URL: https://github.com/gagan3012/image2audio
- Owner: gagan3012
- License: mit
- Created: 2022-02-07T09:09:33.000Z (over 3 years ago)
- Default Branch: master
- Last Pushed: 2025-01-06T20:14:17.000Z (5 months ago)
- Last Synced: 2025-02-13T20:13:48.267Z (3 months ago)
- Topics: fastspeech2, gpt-2, image-captioning, imagecaptioning, pytorch, speech-to-text, vit
- Language: Python
- Homepage:
- Size: 30.3 KB
- Stars: 0
- Watchers: 3
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Codeowners: .github/CODEOWNERS
Awesome Lists containing this project
README
# image2audio
My personal python projects templateWill be used in all new projects by me