https://github.com/csuoc/breaking_vlad_audio_AI
https://github.com/csuoc/breaking_vlad_audio_AI
ai audio aws openai polly python pytube selenium transcription whisper
Last synced: over 1 year ago
JSON representation
- Host: GitHub
- URL: https://github.com/csuoc/breaking_vlad_audio_AI
- Owner: csuoc
- License: mit
- Created: 2023-01-05T17:23:39.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2023-01-25T11:01:09.000Z (over 3 years ago)
- Last Synced: 2024-10-24T10:07:43.878Z (over 1 year ago)
- Topics: ai, audio, aws, openai, polly, python, pytube, selenium, transcription, whisper
- Language: Python
- Homepage:
- Size: 52.7 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Automatic extraction, transcription and translation of Youtube channel's videos using Python and Artificial Intelligence.
I keep learning and progressing. These last weeks I have had the honour of collaborating with Vladimir Sánchez Gonzaga, CEO of **BreakingVlad** (https://breakingvlad.com/) with more than 350.000 subscribers on Youtube, for the expansion of his educational project in English.
Digging into the field of automation and AI, I have written a couple of scripts in Python that allow the extraction of all the links from his published videos by webscraping with **Selenium**, extract the audio with **pytube** and use the new AI **Whisper** for transcription and translation of the audios. The text is then read by **Polly**, an **AWS** service. The final file is an English translation of the original audio with an almost natural tone.
In the following video you can see an example of how it would look like. Thank you very much Vladimir for the opportunity.
https://user-images.githubusercontent.com/20581587/214352603-e595d659-eede-44f9-b4f4-76da875d00b0.mp4