https://github.com/rohit-2301/hiresense
HireSense is an AI-powered resume classifier that uses NLP and Machine Learning to predict the best-fit job role from a PDF resume. Built with Streamlit, it features a clean UI for uploading resumes and instantly suggests roles like Data Scientist, Full Stack Developer, and DevOps Engineer.
https://github.com/rohit-2301/hiresense
joblib ml nlp pymupdf python scikit-learn streamlit tfidfvectorizer
Last synced: 3 months ago
JSON representation
HireSense is an AI-powered resume classifier that uses NLP and Machine Learning to predict the best-fit job role from a PDF resume. Built with Streamlit, it features a clean UI for uploading resumes and instantly suggests roles like Data Scientist, Full Stack Developer, and DevOps Engineer.
- Host: GitHub
- URL: https://github.com/rohit-2301/hiresense
- Owner: Rohit-2301
- Created: 2025-07-13T06:23:32.000Z (3 months ago)
- Default Branch: main
- Last Pushed: 2025-07-13T06:54:24.000Z (3 months ago)
- Last Synced: 2025-07-13T08:39:30.428Z (3 months ago)
- Topics: joblib, ml, nlp, pymupdf, python, scikit-learn, streamlit, tfidfvectorizer
- Language: Python
- Homepage: https://hiresense-peqy6gcp2srkufdpmh7mbp.streamlit.app
- Size: 301 KB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# 🤖 HireSense – AI-Powered Resume Classifier
[](https://streamlit.io)
[](https://www.python.org/)
[](LICENSE)> An intelligent resume classification system that predicts the best-fit job role from a given resume PDF using NLP + Machine Learning.
---
## Live Demo
**Try it here**:
🔗 [https://hiresense-peqy6gcp2srkufdpmh7mbp.streamlit.app](https://hiresense-peqy6gcp2srkufdpmh7mbp.streamlit.app)---
## Preview
---
## Features
- Upload a resume PDF
- Extracts clean text using PyMuPDF
- Predicts job role using Logistic Regression
- Supports job categories like:
- Data Scientist
- Backend Developer
- Frontend Developer
- Full Stack Engineer
- DevOps Engineer
- Clean interactive UI powered by Streamlit---
## 🛠 Tech Stack
| Area | Tools & Libraries |
|--------------|-------------------------------------------------------------|
| Language | Python |
| ML/NLP | `scikit-learn`, `pandas`, `joblib`, `TfidfVectorizer` |
| PDF Parsing | `PyMuPDF` (`fitz`) |
| Web UI | `Streamlit` |---
## 📁 Folder Structure
```bash
HireSense/
├── app.py # Streamlit App
├── requirements.txt # Dependencies
├── model/ # Trained ML model + vectorizer
├── preprocess/ # Data processing scripts
├── data/ # labels.csv, resume texts
├── .gitignore
└── README.md---
## How to Run Locally
### 1. Install dependencies
```bash
pip install -r requirements.txt
````### 2. Run the Streamlit app
```bash
streamlit run app.py
```---
## How to Train Your Own Model
If you want to retrain the model using the Kaggle dataset:
```bash
python preprocess/convert_kaggle_dataset.py
python model/train_model.py
```* The first script converts the CSV into labeled text files.
* The second script trains a logistic regression model and saves it as `.pkl`.````