Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/403errors/tubequery
TubeQuery is a LLM based model, fetching all the queries related to your video. Just input the video link and all the qestiones are welcomed!
https://github.com/403errors/tubequery
huggingface-transformers langchain nlp-machine-learning pipeline python3 tiktoken whisper yt-dlp
Last synced: about 14 hours ago
JSON representation
TubeQuery is a LLM based model, fetching all the queries related to your video. Just input the video link and all the qestiones are welcomed!
- Host: GitHub
- URL: https://github.com/403errors/tubequery
- Owner: 403errors
- Created: 2024-12-02T21:10:39.000Z (2 months ago)
- Default Branch: main
- Last Pushed: 2025-02-12T23:30:25.000Z (2 days ago)
- Last Synced: 2025-02-13T00:27:20.328Z (2 days ago)
- Topics: huggingface-transformers, langchain, nlp-machine-learning, pipeline, python3, tiktoken, whisper, yt-dlp
- Language: Jupyter Notebook
- Homepage: https://www.kaggle.com/code/sitama/tubequery
- Size: 336 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# TubeQuery
**TubeQuery** is an LLM-based model designed to fetch queries related to YouTube videos. By simply inputting a video link, you can ask questions about the video's content and receive answers.
[![Kaggle](https://img.shields.io/badge/Kaggle-View%20on%20Kaggle-blue?logo=kaggle)](https://www.kaggle.com/code/sitama/tubequery)
Here is the [publication](https://app.readytensor.ai/publications/sw0uANUAdEXE) of TubeQuery.
## Overview
TubeQuery leverages advanced Natural Language Processing (NLP) techniques to analyze video content. It extracts audio, transcribes it, and then uses a language model to answer questions related to the video's content. This makes it easy to get quick answers and summaries from video lectures, tutorials, interviews, and more.
## Features
Here's a breakdown of the key features of TubeQuery:
1. **Video Analysis and Processing:**
- Accepts video links (e.g., YouTube) as input.
- Automatically extracts audio from the provided video link.
- Performs speech-to-text conversion using OpenAI's Whisper model to generate accurate transcripts.
- Enables querying of the video content based on the generated transcript.![Input Video Link](imgs/input_video_link.png)
2. **Natural Language Query Support:**
- Allows users to ask questions in natural language, making it intuitive and user-friendly.
- Provides accurate and contextually relevant answers derived directly from the video content.![Question and Answers](imgs/question_answers.png)
3. **Transcript Summarization:**
- Offers the capability to summarize lengthy videos into concise summaries.
- Highlights the most important key points, saving users valuable time.![Summary](imgs/summary.png)
4. **Multi-Language Support:**
- Supports transcription and querying in multiple languages, depending on the language spoken in the video.5. **Adaptable Framework:**
- Designed to work with videos from various platforms, as long as they are accessible via a public link.
- Compatible with a wide range of video content, including educational tutorials, academic lectures, and engaging interviews.---
## Future Improvements
The following are planned enhancements for future versions of TubeQuery:
1. **Enhanced Accuracy:**
- Integrating more advanced AI models to further improve the accuracy of both transcription and query responses.
- Implementing context-aware models to better understand complex or potentially ambiguous queries.2. **Real-Time Processing:**
- Exploring the possibility of enabling live video analysis to provide real-time query responses during live events or streams.3. **Support for Multiple Video Sources:**
- Expanding support to include private videos, YouTube playlists, and the option for users to upload custom media files.4. **Improved Interface:**
- Developing a more interactive and user-friendly interface, potentially including features like voice input for queries and enhanced visualization of results.5. **Advanced Analytics:**
- Incorporating advanced analytics features to provide insights into video content, such as sentiment analysis, keyword extraction, and topic detection.6. **Integration with External Tools:**
- Aiming to integrate with popular note-taking applications, learning management systems (LMS), and collaboration platforms to enhance usability and workflow.7. **Cloud-Based Deployment:**
- Transitioning to a cloud-based deployment to allow for scalable processing of larger datasets and to handle high traffic usage efficiently.8. **Personalization:**
- Implementing personalization features to enable user-specific recommendations and adjustments based on individual preferences and past query history.---
## Tech Stack
TubeQuery is built using the following technologies:
1. **Backend:**
- **Programming Language**: Python
- **Framework**: Utilizes standard libraries and direct implementation without relying on extensive API frameworks.2. **Speech-to-Text:**
- **Libraries**: OpenAI Whisper for accurate and efficient speech-to-text conversion.3. **Natural Language Processing:**
- **Libraries/Models**: Hugging Face Transformers for advanced NLP tasks and question answering.4. **Video Processing:**
- **Tools**: FFMPEG is used for robust audio extraction and general video handling.