Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/apurvjha123/aiquest

npm package for semantic search & chatbot QnA using OpenAI/ChatGPT. Includes web parsing, chunking, embedding, and AWS storage
https://github.com/apurvjha123/aiquest

Last synced: 12 days ago
JSON representation

npm package for semantic search & chatbot QnA using OpenAI/ChatGPT. Includes web parsing, chunking, embedding, and AWS storage

Awesome Lists containing this project

README

        

# πŸ“– AIQuest

### Aiquest also known as RAG-AIQuest is an npm package that streamlines the process of parsing websites, splitting content into manageable chunks, embedding these chunks into machine-friendly vectors, and subsequently storing and retrieving these embeddings from AWS. This documentation outlines its design and possibilities.

### πŸš€ Table of Contents
- πŸ”§ [Design](#-Design)
- πŸ” [Installation](#-Installation)
- πŸ› οΈ [Usage](#%EF%B8%8F-Usage)
- πŸ•ΈοΈ [Parsing](#%EF%B8%8F-Parsing)
- βœ‚οΈ [Chunking](#%EF%B8%8F-Chunking)
- 🧬 [Embedding](#-Embedding)
- ☁️ [Storing on AWS](#%EF%B8%8F-Storing-on-AWS)
- πŸ”Ž [Retrieval](#-Retrieval)
- πŸ“ [Examples](#-Examples)
- 🌟 [Future Enhancements](#-Future-Enhancements)
- 🀝 [Contribution](#-Contribution)
- πŸ› [Bug Reporting](#-Bug-Reporting)

## πŸ”§ Design
#### rag-aiquest integrates several utilities under one package:

**UnifiedParser**: For parsing content from URLs,PDF or Text File.

**ChunkUtility**: To split the parsed content into chunks.

**EmbeddingUtility**: Utilizes the OpenAI API to embed the chunks into vectors.

**VectorStoreAWS**: A utility for AWS operations related to embedding storage.

**Retrival**: Provides functionality to retrieve knowledge and run QnA.

## πŸ” Installation

```javascript
npm install rag-aiquest
```

## πŸ› οΈ Usage

### πŸ•ΈοΈ Parsing

***Use the UnifiedParser to parse content from a URL.***

```javascript
const parser = new UnifiedParser();
const parsedValue = await parser.parse('YOUR_URL_HERE');
```

### βœ‚οΈ Chunking

***To split the parsed content into chunks:***

```javascript
const chunks = ChunkUtility.splitIntoChunks(parsedValue, chunkSize, overlapSize);
```

### 🧬 Embedding

***Embed chunks using OpenAI API.***

```javascript
const embedding = new EmbeddingUtility('YOUR_OPENAI_API_KEY');
const embedded = await embedding.createEmbedding(chunks);
```

### ☁️ Storing on AWS

***To upload the embedded model to AWS:***

```javascript
const aws = new VectorStoreAWS(AWS_ACCESS_KEY_ID, AWS_ACCESS_SECRET, AWS_BUCKET_NAME);
await aws.uploadEmbededModeltoAWS(embedded, 'YOUR_FILE_NAME');
```

### πŸ”Ž Retrieval
***To retrieve and query the knowledge:***

```javascript
const knowledge = await aws.getKnowledgeData('YOUR_FILE_NAME');
const retrive = new Retrival('YOUR_OPENAI_API_KEY');
const search = await retrive.QnARetrival(knowledge, 'YOUR_QUERY');
console.log(search.choices[0].message);
```

## πŸ“ Examples
**As given in the provided code, you can easily integrate the utilities to parse, chunk, embed, store, and retrieve knowledge.**

## 🌟 Future Enhancements
***Compression***: Improve storage efficiency by compressing embedded vectors.

***Batch Processing***: Enhance the library to handle batch processing of URLs.

***Support for More Embeddings***: Plan to add support for other embedding APIs.

## 🀝 Contribution
If you wish to contribute to rag-aiquest, please refer to the CONTRIBUTING.md file.

## πŸ› Bug Reporting

Feel free to [open an issue](https://github.com/apurvjha123/aiquest) on GitHub if you find any bug.

## ⭐ Feature Request

- Feel free to [Open an issue](https://github.com/apurvjha123/aiquest) on GitHub to request any additional features you might need for your use case.
- Connect with me on [LinkedIn](https://www.linkedin.com/in/apurv-jha-7367b1236/). I'd love ❀️️ to hear where you are using this library.

## πŸ“‹ Release Notes

Check [here](https://github.com/apurvjha123/aiquest/releases) for release notes.

## πŸ›  Prerequisites
* Active OpenAI API Key
* AWS BUCKET SECRET_KEY,Bucket Name, AUTH KEY

# πŸ’Œ Support
##### If you encounter any issues or require further assistance, please reach out to our support team at [email protected].