Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/saritaphd/end-to-end-document-q---a-using-gemma
The application leverages the power of large language models (LLMs) and document embeddings to perform document retrieval and question answering.
https://github.com/saritaphd/end-to-end-document-q---a-using-gemma
Last synced: 3 days ago
JSON representation
The application leverages the power of large language models (LLMs) and document embeddings to perform document retrieval and question answering.
- Host: GitHub
- URL: https://github.com/saritaphd/end-to-end-document-q---a-using-gemma
- Owner: SaritaPhD
- License: mit
- Created: 2024-07-25T07:52:15.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2024-07-25T08:05:49.000Z (4 months ago)
- Last Synced: 2024-07-26T09:19:30.834Z (4 months ago)
- Language: Python
- Homepage:
- Size: 4.12 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# End-To-End-Document-Q&A-Using-Gemma
## Overview
This Streamlit application allows users to interact with documents by asking questions and receiving accurate answers based on the provided context. The application leverages the power of large language models (LLMs) and document embeddings to perform document retrieval and question answering.## Features
- Load documents from a specified directory
- Split documents into chunks for better processing
- Embed documents using Google Generative AI Embeddings
- Create a vector store for efficient document retrieval
- Answer questions based on the content of the documents## Requirements
- Python 3.9+
- Streamlit
- langchain
- langchain_groq
- langchain_community
- dotenv
- PyPDF2
- faiss## Installation
- Clone the repository: git clone [email protected]:SaritaPhDEnd-To-End-Document-Q---A-Using-Gemma.git
- cd End-To-End-Document-Q---A-Using-Gemma
- Create a virtual environment: python3.10 -m venv venv
- source venv/bin/activate # On Windows use `venv\Scripts\activate`
- Install the required packages: pip install -r requirements.txt## Set up your environment variables:
- Create a .env file in the root directory of the project.
- Add your GROQ and Google API keys:
GROQ_API_KEY=your_groq_api_key
GOOGLE_API_KEY=your_google_api_key## Usage
Run the Streamlit application: streamlit run app.pyThe application interface will open in your web browser.
## Load the documents:
- Click on the "Documents Embedding" button to load and process the documents from the ./us_census directory.
- Ask a question: Enter your question in the text input field and press Enter. The application will process your question and provide an answer based on the content of the documents.## Project Structure
- app.py: Main application file
- requirements.txt: List of required Python packages
- .env: Environment variables file (not included in the repository)![Alt text]()