Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/jalvarezz13/summarai

SummarAI utilizes PyMovie and Whisper to transcribe videos, enabling you to ask questions about the content using Llama2 and Llama-index for insightful interaction.
https://github.com/jalvarezz13/summarai

llama-index llama2 pymovie whisper

Last synced: about 1 month ago
JSON representation

SummarAI utilizes PyMovie and Whisper to transcribe videos, enabling you to ask questions about the content using Llama2 and Llama-index for insightful interaction.

Host: GitHub
URL: https://github.com/jalvarezz13/summarai
Owner: jalvarezz13
License: mit
Created: 2023-10-13T16:44:54.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2023-10-13T20:31:03.000Z (over 1 year ago)
Last Synced: 2024-10-18T22:01:20.348Z (3 months ago)
Topics: llama-index, llama2, pymovie, whisper
Language: Python
Homepage:
Size: 62 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

🦙 SummarAI 🤖

### Table of Contents

- [📢 What is _SummarAI_?](#-what-is-summarAI)
- [⚙️ Execution](#%EF%B8%8F-execution)
- [⭐ Current Features](#-current-features)
- [⚒️ Troubleshooting](#-troubleshooting)

## 📢 What is _SummarAI_?

**SummarAI** employs the combination of _PyMovie_ and _Whisper_ to provide video transcription services, allowing users to ask questions about the video's content with the help of _Llama2_ and _LlamaIndex_ for more engaging and insightful interactions.

**SummarAI** is divided into three different files:

- `1_extract_audio.py`: This file utilizes the _PyMovie_ library to extract audio from the specified video in a matter of seconds.

- `2_transcribe_audio.py`: In this case, the "speech recognition" model from _OpenAI_, known as _Whisper_, is used to transcribe the audio into a text file.

- `3_llm_over_data`: Finally, this file makes use of _LlamaIndex_, which, based on the _Llama2-13B_ model, provides context to the data from the previously prepared video and responds accurately within that context.

## ⚙️ Execution

To run the **SummarAI**, you can use `pipenv`. First, ensure you have `pipenv` installed. Then, navigate to the root directory of the project, where you can install the dependencies and activate the virtual environment using the following commands:

```bash
pipenv install
pipenv shell
```

Afterwards, simply run the file you want using one of these commands:

```bash
python 1_extract_audio.py
python 2_transcribe_audio.py
python 3_llm_over_data.py
```

## ⭐ Current features

✅ Extract audio from videos

✅ Transcribe audio using Whisper

✅ Resolve your inquiries with the assistance of Llama2

## ⚒️ Troubleshooting

If you encounter issues while installing the `llama-cpp-python` library, you may need to run the following commands:

```bash
set CMAKE_ARGS=-DLLAMA_CUBLAS=on
set FORCE_CMAKE=1
```

If the error persists, please check your C compiler (_gcc_) or, if you are using Windows, update/install the _"Desktop development with C++"_ feature by utilizing the [Microsoft C++ Build Tools](https://visualstudio.microsoft.com/es/visual-cpp-build-tools/)

Made with ❤️ by jalvarezz13

Logo generated using DALL-E 3