https://github.com/TWIML/TWIML-RAG

Last synced: about 2 months ago
JSON representation

Host: GitHub
URL: https://github.com/TWIML/TWIML-RAG
Owner: TWIML
Created: 2023-07-07T16:05:06.000Z (almost 2 years ago)
Default Branch: main
Last Pushed: 2024-03-16T17:54:25.000Z (over 1 year ago)
Last Synced: 2024-10-27T23:59:34.435Z (8 months ago)
Language: Jupyter Notebook
Size: 104 MB
Stars: 3
Watchers: 5
Forks: 3
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

awesome-ai-engineering-reads - TWIML-RAG - a TWIML generative_ai community project.

README

        # TWIML-RAG - a TWIML Generative AI community project.

This project aims to create a generative AI dialog application as a learning exercise for our community. The application consists of a transcription pipeline to transcribe TWIML podcast episodes for human and bot consumption, and the dialog agent/bot itself, which will be available to our community to answer questions about the podcast and its subject area. Both of these aim to offer additional resources for podcast listeners and community members to further their learning and education about ML/AI.

## Architecture

```mermaid

flowchart TB

  subgraph TWIML Podcast RAG

    subgraph Transcription Pipeline

        direction LR

        A[Podcast Audio File - mp3 fa:fa-file-audio] --> B["AUDIO-TO-TEXT

        - WhisperX -"]

        A --> C["SPEAKER IDENTIFICATION

        - pyannote.audio -"]

        G[Shownotes] --> H[METADATA]

        I[Web Crawls] --> H

        H --> D

        B --> D[FUSION]

        C --> D

        D --> E[(fa:fa-file Transcription Data Files)]

        E --> F[Index]

    end

    subgraph "Chatbot using Augmented Generation"

        direction LR

            P[QUERY] ---> Q[RETRIEVER]

            Q ---> R[GENERATOR]

            Q <-- augmentation --> F

    end

  end

```

## Project/Repo Overview

There are four components to the project/repository. The first is the Speech-to-Text Pipeline (`speech_to_text`) which is used to create the transcripts from the podcasts. Next is the Embeddings Pipeline (`embeddings`) that creates the embeddings from the transcripts. Third is the RAG Backend (`fn_rag`) which runs azure functions locally that serve as end-points for the RAG client. And finally the RAG Frontend (`web_rag`) is the website to run the RAG client which calls the azure end-points.

## Repo Setup Instructions

Please see the individual component README files for setup instructions.

### Speech to Text Pipeline

See [speech_to_text/README.md](proj/speech_to_text/README.md)

### Qdrant Embeddings

See [embeddings/README.md](proj/embeddings/README.md)

### Rag Backend (Azure Functions)

See [fn_rag/README.md](proj/fn_rag/README.md)

### RAG Frontend (Browser)

See [web_rag/README.md](proj/web_rag/README.md)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/TWIML/TWIML-RAG

Awesome Lists containing this project

README