https://github.com/arjun-ms/annual-report-reader
https://github.com/arjun-ms/annual-report-reader
Last synced: 10 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/arjun-ms/annual-report-reader
- Owner: arjun-ms
- Created: 2023-08-31T06:33:10.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2023-09-03T06:00:06.000Z (almost 3 years ago)
- Last Synced: 2025-05-06T18:08:39.537Z (about 1 year ago)
- Language: Python
- Size: 32.6 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README

# Annual Report Reader
Our project aims to create an interactive and user-friendly web application that allows users to extract insights and information from Annual Reports of a company PDF documents.
We integrate Langchain, OpenAI, and ChromaDB to deliver accurate and relevant responses.
## Team members
1. [Arjun M S](https://github.com/arjun-ms)
2. [Muhammed Ajmal](https://github.com/ajmalmohad)
3. [Namitha S](https://github.com/Namitha-S-11465)
## Link to product walkthrough
[link to video](https://youtu.be/8ZkNIeTjeLI)
## How it Works ?
This web application harnesses the power of Langchain, OpenAI, and ChromaDB to facilitate efficient PDF document interaction.
Users upload PDFs, which are automatically processed into smaller text chunks for analysis.
OpenAI's language model is employed to handle user questions with precision, providing contextually relevant answers.
ChromaDB stores document embeddings, enabling retrieval of information from the PDFs.
Users simply input their queries, and the system matches these inquiries with the PDF content, offering concise and accurate responses.
## Libraries used
1. Chromadb
2. Langchain
3. OpenAI
4. PyPDF
## How to configure
1. Clone this repository
```
$git clone https://github.com/arjun-ms/Annual-Report-Reader.git
```
2. Install the libraries given in requirements.txt
```
$pip install -r requirements.txt
```
## How to Run
```
$streamlit run annualreport.py
```