Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/adarsh-k27r/salahkart

Collection of some of my works during my internship period at Salahkart for preview and educational purpose only.
https://github.com/adarsh-k27r/salahkart

bs4 cross-encoder database-design embeddings huggingface-transformers integration-testing ner nlp-machine-learning nltk numpy-arrays pandas python resume-parser selenium-webdriver sentence-transformers skills-assessment spacy-nlp sts transformers unit-testing

Last synced: 6 days ago
JSON representation

Collection of some of my works during my internship period at Salahkart for preview and educational purpose only.

Awesome Lists containing this project

README

        

# Internship Experience

**Software Engineer Intern**

**SalahKart- A Product of Jaspy Technologies Pvt. Ltd.**

**CIN: U78100UP2024PTC199951**

_Duration: 2 months_

## Roles and Responsibilities:

- **NLP and Web Scraping:**
- Employed **BeautifulSoup** and **Selenium WebDriver** to scrape data from LinkedIn profiles.
- Conducted **resume parsing**, extracting skills from resumes and job descriptions to calculate the matching percentage.

- **Skill Matching and Semantic Textual Similarity (STS):**
- Implemented three primary methods for skill matching:
- **SentenceTransformer Model**
- **CrossEncoder Model**
- **Dictionary Method** using a predefined skill list from a CSV file.
- Combined the methods using a normalized formula to compute the final skill matching percentage.
- Conducted **Semantic Textual Similarity (STS)** for accurate skill assessment.

- **Natural Language Processing (NLP):**
- Utilized **SpaCy** and **NLTK** for advanced NLP tasks.
- Developed a **Named Entity Recognition (NER) pipeline** using **Transformers by Hugging Face**.
- Extracted nouns and verbs from resumes on a section-wise basis for enhanced resume analysis.

- **Development and Testing:**
- Developed and tested code in **Python** using **NumPy**, **Pandas**, and regular expressions.
- Used **VS Code** and **Google Colab** for coding and collaboration.
- Designed and documented workflow using **Lucid Chart** for flow diagrams and brainstorming sessions.
- Created a backend database with a **Level 2 ER Diagram** using the **Eraser.io**.
- Performed **unit and integration testing** manually to ensure code quality and reliability.

## Technical Skills:

- Programming Languages: **Python**
- Libraries & Frameworks: **SpaCy**, **NLTK**, **NumPy**, **Pandas**, **BeautifulSoup**, **Selenium WebDriver**
- Machine Learning Models: **SentenceTransformer**, **CrossEncoder**
- Tools: **VS Code**, **Google Colab**, **Lucid Chart**, **Eraser.io**
- Database Design: **Level 2 ER Diagrams**
- Testing: **Manual Unit and Integration Testing**
- Additional Skills: **Transformers**, **NLP**, **Web Scraping**, **Semantic Textual Similarity (STS)**, **Named Entity Recognition (NER)**

## Alert

Provided Codes don't represent my complete work during internship and are Incomplete and partial. It's only for preview and educational purposes. These are not the Final production grade codes either. Kindly use it with caution ⚠