Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/abinashsahoo007/project-resume-classification
The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.
https://github.com/abinashsahoo007/project-resume-classification
corpus count-vectorizer label-encoding lemmitization machine-learning nltk part-of-speech-tagging resume-classification spacy stemming text-mining text-preprocessing textract tfidf-vectorizer tokenization wordcloud
Last synced: 6 days ago
JSON representation
The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.
- Host: GitHub
- URL: https://github.com/abinashsahoo007/project-resume-classification
- Owner: abinashsahoo007
- Created: 2024-06-30T13:52:38.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2024-07-03T11:40:57.000Z (4 months ago)
- Last Synced: 2024-10-31T04:06:07.365Z (6 days ago)
- Topics: corpus, count-vectorizer, label-encoding, lemmitization, machine-learning, nltk, part-of-speech-tagging, resume-classification, spacy, stemming, text-mining, text-preprocessing, textract, tfidf-vectorizer, tokenization, wordcloud
- Language: Jupyter Notebook
- Homepage:
- Size: 11.6 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Project-Resume-Classification
![1_HNxc6VCVkRCWV4Rdr0SMiQ](https://github.com/abinashsahoo007/Project-Resume-Classification/assets/174187930/63ead643-6570-4e34-9b7c-6ac7a94328ad)
## PPT Presentation:
[View presentation](https://docs.google.com/presentation/d/1mDSFxBfpzJGetrCUC_Eyt_1tfaiqTE0U/edit#slide=id.p37)# Business Problem:
The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human.# Objective:
The objective of a resume classification project typically involves automating the process of categorizing resumes into different job roles, industries, or skill sets. This can streamline the hiring process for recruiters and HR departments by enabling them to quickly identify the most relevant resumes from a large pool of applicants.# Abstract:
A resume is a brief summary of your skills and experience. Companies recruiters and HR teams have a tough time scanning thousands of qualified resumes. Spending too many labor hours segregating candidates resume's manually is a waste of a company's time, money, and productivity. Recruiters, therefore, use resume classification in order to streamline the resume and applicant screening process. NLP technology allows recruiters to electronically gather, store, and organize large quantities of resumes. Once acquired, the resume data can be easily searched through and analyzed.Resumes are an ideal example of unstructured data. Since there is no widely accepted resume layout, each resume may have its own style of formatting, different text blocks and different category titles. Building a resume classification and gathering text from it is no easy task as there are so many kinds of layouts of resumes that you could imagine.
# Architecture
![Overall-Architecture-of-the-Proposed-System](https://github.com/abinashsahoo007/Project-Resume-Classification/assets/174187930/59208bed-273e-47d8-9383-9fe97514d1fb)
>
**🔹The basic data analysis process performed such as data collection, text mining, data cleaning, exploratory data analysis, data visualization.**
>
**🔹Building a Machine learning model for Resume Classification using Python and basic Natural language processing techniques.**
>
**🔹Used Python's libraries to implement various NLP techniques like tokenization, lemmatization, parts of speech tagging, etc.**
>
**🔹A resume classification analyzes resume data and extracts the information into the machine-readable output. It helps automatically store, organize, and analyze the resume data to find out the candidate for the particular job position and requirements.**
>
**🔹The aim of this project is achieved by performing the various data analysis methods and using the Machine Learning models and Natural Language Processing which will help in classifying the categories of the resume and building the Resume Classification Model.**# Different ML Algorithm used:
1. Logistic Regression
2. DecisionTree Classifier
3. KNN Classifier
4. SVM Classifier
5. NaiveBayes Classifier
6. RandomForest Classifier
7. Bagging Classifier
8. AdaBoost Classifier
9. Gradient Boosting Classifier
10. Voting Classifier![Screenshot 2024-06-30 224032](https://github.com/abinashsahoo007/Project-Resume-Classification/assets/174187930/6255bd47-c6a0-4172-a377-200f14ea13ba)
# Final Model:
- Model Selected: **Random Forest Classifier**
>
**Random Forest Classifier :** RFC works by creating multiple decision trees during the training phase and outputting the mode of the classes (classification) or mean prediction (regression) of the individual trees.>
# Deployment :
*For Deployment **Streamlit App** is Used*
![Screenshot 2024-06-30 224534](https://github.com/abinashsahoo007/Project-Resume-Classification/assets/174187930/48f342e3-b2b0-42a8-a7fd-676326123b59)
![Screenshot 2024-06-30 224550](https://github.com/abinashsahoo007/Project-Resume-Classification/assets/174187930/499f6d69-8180-4b68-8688-eddff081c3b5)
## Thank You For Visiting.....