https://github.com/godeltech/semantic_qa

A small semantic Q&A demo using langchain
https://github.com/godeltech/semantic_qa

Last synced: 9 months ago
JSON representation

A small semantic Q&A demo using langchain

Host: GitHub
URL: https://github.com/godeltech/semantic_qa
Owner: GodelTech
License: mit
Created: 2023-09-25T23:34:51.000Z (almost 3 years ago)
Default Branch: main
Last Pushed: 2024-05-11T09:53:03.000Z (about 2 years ago)
Last Synced: 2024-12-26T18:19:05.666Z (over 1 year ago)
Language: Python
Homepage:
Size: 66.4 KB
Stars: 0
Watchers: 3
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # semantic_qa

A small semantic Q&A demo using langchain and openai

## Prerequisites

The only hard requirements are:

1. Python 3.10+ with pip and virtualenv.

2. An [OpenAI API](https://openai.com/product) key. Although this will require setting up a payment plan with a credit card, per-call costs are [very low](https://openai.com/pricing).

  

Although it's not a pre-requisite, having a CUDA-compatible GPU is strongly advised to generate text embeddings locally using larger models.

## Dependencies

After cloning this repo, create and activate a Python virtual environment, then install the required Python packages using pip:

```Powershell

PS > virtualenv venv

PS > venv\scripts\activate.ps1

(venv) PS > pip install -r requirements_dev.txt

(venv) PS > pip install -r requirements.txt

```

```sh

$ virtualenv venv

$ source venv/bin/activate

(venv) $ pip install -r requirements_dev.txt

(venv) $ pip install -r requirements.txt

```

If you have a CUDA-capable GPU, install the right torch packages as described at https://pytorch.org/get-started/locally/ 

```

pip install -U --force-reinstall --no-deps torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118

```

## Document corpus

Choose whatever document folder you fancy. Why not try a local copy of Godel's security policies from Sharepoint? 

## Vector database

The demo currently supports the following vector stores:

 * [ChromaDB](https://www.trychroma.com/)

 * [Pgvector](https://github.com/pgvector/pgvector). [Set-up instructions](vector_stores_howtos/pgvector.md)

 * [Redis Stack](https://redis.io/docs/about/about-stack/). [Set-up instructions](vector_stores_howtos/redis-stack.md)

 * [Pinecone](https://www.pinecone.io/). [Set-up instructions](vector_stores_howtos/pinecone.md)

 * [MongoDB Atlas](https://www.mongodb.com/atlas/database). [Set-up instructions](vector_stores_howtos/mongodb_atlas.md)

 * [Elasticsearch](https://www.elastic.co/elasticsearch/vector-database). [Set-up instructions](vector_stores_howtos/elasticsearch.md)

 * [Neo4j](https://neo4j.com/docs/cypher-manual/current/indexes-for-vector-search/). [Set-up instructions](vector_stores_howtos/neoj4.md)

The first one uses file-based SQLite for storage and does not require any work. All the others need some set up, detailed in the links above.

## Embeddings generator models

The demo currently supports:

 * Calling the [OpenAI embeddings API](https://platform.openai.com/docs/api-reference/embeddings), which requires an API key and a payment plan, using the model ["text-embedding-ada-002"](https://openai.com/blog/new-and-improved-embedding-model) by default

 * Generating embeddings locally using torch and a pre-trained model downloaded from [Hugging Face](https://huggingface.co/models). The default model is ["all-MiniLM-L6-v2"](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2)

 * Generating embeddings locally using torch and one of the pre-trained [Instructor](https://github.com/HKUNLP/instructor-embedding) models. The default used is ["hkunlp/instructor-large"](https://github.com/HKUNLP/instructor-embedding#model-list)

## Running the demo

```

(venv) $> python semantic_qa.py

```

The first time it runs, leave REBUILD = True to ensure the script iterates over the files in the corpus and generates the embeddings. In successive runs, you can change REBUILD = False and just test different values of QUERY_STR or tweaks to the GPT prompt.

## Web UI

```

(venv) $> chainlit run ./chainlit_app.py -w

```

This will run a small web UI on port 8000 by default.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/godeltech/semantic_qa

Awesome Lists containing this project

README