Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/invictusaman/glassdoor-webscraper

I designed a scraping πŸ•ΈοΈ tool to extract job posting data from Glassdoor. This scraping tool will return job title, company name, job id, location, salary, language and skills and many more.
https://github.com/invictusaman/glassdoor-webscraper

data-analyst data-collection glassdoor glassdoor-scraper python3 webscraping

Last synced: 3 days ago
JSON representation

I designed a scraping πŸ•ΈοΈ tool to extract job posting data from Glassdoor. This scraping tool will return job title, company name, job id, location, salary, language and skills and many more.

Awesome Lists containing this project

README

        

# Glassdoor Scraper
I designed a scraping πŸ•ΈοΈ tool to extract job posting data from Glassdoor. This scraping tool will return job title, company name, job id, location, salary, language and skills and many more.

*It was easier to extract Glassdoor data compared to Indeed because job postings in Glassdoor are organized, properly labelled and glassdoor also provides estimated salary (if not present).*

**Thank you Glassdoor**


## Step 1: Install dependencies

Install required dependencies in your project folder.

```
pip install -r requirements.txt
```

## Step 2: Run Glassdoor_Scraper.py

Make sure you have Chrome ⬇️ latest version installed in your system. This step creates `scraped_glassdoor_job_file.csv` with all columns. *You can check the sample output in this repository itself, I extracted for Data Analyst Position in Canada.*

### Further Work:

Currently, only one web address can be processed during each run. Create a list of different addresses, and pass the index value; the tool should fetch each url one by one, and scrap accordingly, and create a final output or multiple outputs.

#### Follow my data-analyst journey: [Portfolio_Link](https://www.amanbhattarai.com)