https://github.com/gokulgowthams/askdocs_gen-ai
Private Document Questioning and Answering Application which can answer any question that has been asked, by uploading the desired document the user can ask questions
https://github.com/gokulgowthams/askdocs_gen-ai
chromadb deeplearning docx2txt faiss generativeai huggingface langchain llama2 llamaindex openaiembeddings pdfplumber pinecone pymupdf sentencetransformer streamlit
Last synced: about 2 months ago
JSON representation
Private Document Questioning and Answering Application which can answer any question that has been asked, by uploading the desired document the user can ask questions
- Host: GitHub
- URL: https://github.com/gokulgowthams/askdocs_gen-ai
- Owner: GokulGowthamS
- Created: 2025-04-08T09:46:30.000Z (6 months ago)
- Default Branch: main
- Last Pushed: 2025-04-12T14:02:30.000Z (6 months ago)
- Last Synced: 2025-07-22T04:47:32.950Z (3 months ago)
- Topics: chromadb, deeplearning, docx2txt, faiss, generativeai, huggingface, langchain, llama2, llamaindex, openaiembeddings, pdfplumber, pinecone, pymupdf, sentencetransformer, streamlit
- Language: Python
- Homepage:
- Size: 2.1 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# ๐ฌ AskDocs AI
> *From documents to decisions โ powered by AI, secured locally.*
## ๐ What is AskDocs AI?
**AskDocs AI** is a early stage startup comprising a powerful and private document question-answering app powered by **LLaMA 2** and **LangChain**. Upload your PDF, DOCX, or TXT files, and ask intelligent questions. All data is processed locally to maintain maximum privacy.
---
## ๐ฏ Features
- ๐ Supports PDF, DOCX, and TXT files
- ๐ง Local LLaMA 2 model inference (no cloud dependencies)
- ๐ Intelligent search & similarity-based retrieval
- ๐ฆ Built with Streamlit for an interactive UI
- ๐ผ๏ธ Custom dark-themed background support
- ๐งต Efficient multi-threaded document parsing---
## ๐ Getting Started
### ๐ฆ Prerequisites
- Python 3.9+
- Download the LLaMA 2 model from [HERE!](https://huggingface.co/gokulgowtham01/AskDocs_GEN-AI/tree/main)### ๐ Folder Structure
```
project_root/
โโโ gen_ai.py # Streamlit application
โโโ images/
โ โโโ freepik__adjust__9850.jpeg
โ โโโ 767.jpg
โโโ faiss_index/ # Generated vector store (auto-created)
โโโ evaluation_logs/ # Generated json file (auto-created)
โโโ documents/
โ โโโ empty.txt
โ โโโ sample_doc.docx
โ โโโ Advanced_Facts_Octopus.pdf
โ โโโ sample_txt.txt
โโโ requirements.txt
```---
## ๐งฐ Installation
```bash
# Clone the repo
git clone https://github.com/yourusername/AskDocs_GEN-AI.git
cd AskDocs_GEN-AI# (Optional) Create a virtual environment
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate# Install dependencies
pip install -r requirements.txt```
## ๐ง Usage```bash
# Run
streamlit run app.py
```1. Upload one or more PDF, DOCX, or TXT documents.
2. Click Start The Fun!
3. Ask your question in natural language.
4. Get answers with cited document sources.## ๐ฆ Dependencies
- streamlit
- PyMuPDF
- python-docx
- docx2txt
- requests
- numpy
- langchain
- sentence-transformers
- nltk
- rouge-score
- ctransformers
- faiss-cpu
- huggingface-hub## ๐ Notes
- You need to download and manually place the LLaMA model.
- Ensure GPU support is configured if needed (set gpu_layers accordingly).
- Image background is customizable โ make sure the image path is valid.