https://github.com/csuoc/breaking_vlad_audio_AI

ai audio aws openai polly python pytube selenium transcription whisper

Last synced: over 1 year ago
JSON representation

Host: GitHub
URL: https://github.com/csuoc/breaking_vlad_audio_AI
Owner: csuoc
License: mit
Created: 2023-01-05T17:23:39.000Z (over 3 years ago)
Default Branch: main
Last Pushed: 2023-01-25T11:01:09.000Z (over 3 years ago)
Last Synced: 2024-10-24T10:07:43.878Z (over 1 year ago)
Topics: ai, audio, aws, openai, polly, python, pytube, selenium, transcription, whisper
Language: Python
Homepage:
Size: 52.7 KB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Automatic extraction, transcription and translation of Youtube channel's videos using Python and Artificial Intelligence.

I keep learning and progressing. These last weeks I have had the honour of collaborating with Vladimir Sánchez Gonzaga, CEO of **BreakingVlad** (https://breakingvlad.com/) with more than 350.000 subscribers on Youtube, for the expansion of his educational project in English.

Digging into the field of automation and AI, I have written a couple of scripts in Python that allow the extraction of all the links from his published videos by webscraping with **Selenium**, extract the audio with **pytube** and use the new AI **Whisper** for transcription and translation of the audios. The text is then read by **Polly**, an **AWS** service. The final file is an English translation of the original audio with an almost natural tone.

In the following video you can see an example of how it would look like. Thank you very much Vladimir for the opportunity.

https://user-images.githubusercontent.com/20581587/214352603-e595d659-eede-44f9-b4f4-76da875d00b0.mp4

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/csuoc/breaking_vlad_audio_AI

Awesome Lists containing this project

README