An open API service indexing awesome lists of open source software.

https://github.com/mrahmed14/retrieve

This project is an information retrieval system using TF-IDF vectorization
https://github.com/mrahmed14/retrieve

Last synced: about 1 month ago
JSON representation

This project is an information retrieval system using TF-IDF vectorization

Awesome Lists containing this project

README

        

## About This Project
This project is a **information retrieval system** that allows users to upload and search documents (PDF & TXT files). It uses **TF-IDF vectorization** for efficient information retrieval. The backend is built with **Flask**, while the frontend is built with **Next.js**.

### Prerequisites
Ensure you have the following installed:
- [**Node.js**](https://nodejs.org/)
- [**Python**](https://www.python.org/)

## API Endpoints
### Upload Files
- **Endpoint:** `POST /upload`
- **Description:** Uploads PDF and TXT files.
- **Usage:**
```bash
curl -X POST -F "[email protected]" http://127.0.0.1:5000/upload
```

### Search Documents
- **Endpoint:** `GET /search?q=`
- **Description:** Searches for a keyword in the uploaded files.
- **Usage:**
```bash
curl -X GET "http://127.0.0.1:5000/search?q=information"
```

## Contributing
Feel free to submit **issues or pull requests** to improve this project!