https://github.com/runtime-error786/vidtech-insight
https://github.com/runtime-error786/vidtech-insight
chromadb huggingface langchain llava python youtube
Last synced: 10 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/runtime-error786/vidtech-insight
- Owner: runtime-error786
- Created: 2024-08-11T12:28:42.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-08-11T12:41:06.000Z (over 1 year ago)
- Last Synced: 2025-02-01T08:22:20.123Z (12 months ago)
- Topics: chromadb, huggingface, langchain, llava, python, youtube
- Language: Jupyter Notebook
- Homepage:
- Size: 36.9 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Video Content Analyzer
This project is a comprehensive tool that allows users to download a video from YouTube, extract frames and audio, transcribe the audio into text, and analyze both the video frames and the audio text using AI models to provide a summary of the content. The summaries are focused on programming languages and concepts.
Features
Download YouTube Video: Downloads the video from a provided YouTube URL.
Frame Extraction: Extracts frames from the video at a rate of one frame every four seconds.
Audio Extraction: Extracts the audio from the video.
Audio Transcription: Converts the extracted audio into text using Google Speech Recognition.
Frame Analysis: Uses an AI model to generate summaries of the video frames in the context of programming languages.
Audio Analysis: Summarizes the transcribed audio, focusing on programming-related concepts.
Combined Summary: Merges the audio and frame summaries to provide a comprehensive analysis based on a user-provided query.
Content Search: Uses a vector database to store and retrieve relevant content based on the user's query.