An open API service indexing awesome lists of open source software.

https://github.com/yo1am1/scrapy-work

Web Scraper for Python Developer Vacancies
https://github.com/yo1am1/scrapy-work

python scraper scrapy scrapy-crawler web work

Last synced: about 1 year ago
JSON representation

Web Scraper for Python Developer Vacancies

Awesome Lists containing this project

README

          

# Scrapy-work: Web Scraper for Python Developer Vacancies
[![License: MIT](https://img.shields.io/github/license/yo1am1/scrapy-work)](https://github.com/yo1am1/scrapy-work/blob/main/LICENSE)
[![GitHub last commit](https://img.shields.io/github/last-commit/yo1am1/scrapy-work)](https://github.com/yo1am1/scrapy-work/commits/main)
![Lint](https://github.com/yo1am1/scrapy-work/actions/workflows/black.yml/badge.svg?event=push)
![GitHub Latest Pre-Release)](https://img.shields.io/github/v/release/yo1am1/scrapy-work?include_prereleases&label=pre-release&logo=github)

## Installation and Usage 🧠
🥐 **Note: At the moment, the scraper only supports Work.ua. Updates and additional website support are coming soon!**

1. **_Clone_** the repository:
```bash
git clone https://github.com/yo1am1/scrapy-work.git
```

2. **_Navigate_** to the project directory and either open it in PyCharm or use the terminal.

3. **_Install_** the [requirements.txt](requirements.txt):
```bash
pip install -r requirements.txt
```
> [!IMPORTANT]\
> The `requirements.txt` file contains all the dependencies needed to run the scraper locally.

4. **_Run_** web scrapper with command:
```bash
scrapy crawl VacanciesSpider -O data.json
```
> [!NOTE]\
> Where `VacanciesSpider` - is the name of the scrapper, `-O data.json` - rewrites our data to [data.json](/workua/data.json) file.

> [!IMPORTANT]\
> Use this command inside the project folder

5. ### _Good luck at finding job_!

## License
This project is licensed under the [MIT License](LICENSE).