https://github.com/romerorodriguezd/feina-activa-job-scraper
CLI based application to parse jobs published by Servei d'Ocupació de Catalunya, throught its website, Feina Activa.
https://github.com/romerorodriguezd/feina-activa-job-scraper
automation beautifulsoup catalunya cli command-line-tool python selenium webscraping
Last synced: about 1 month ago
JSON representation
CLI based application to parse jobs published by Servei d'Ocupació de Catalunya, throught its website, Feina Activa.
- Host: GitHub
- URL: https://github.com/romerorodriguezd/feina-activa-job-scraper
- Owner: RomeroRodriguezD
- Created: 2024-06-04T14:10:56.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2024-06-04T16:01:31.000Z (about 2 years ago)
- Last Synced: 2024-06-05T16:16:41.392Z (about 2 years ago)
- Topics: automation, beautifulsoup, catalunya, cli, command-line-tool, python, selenium, webscraping
- Language: Python
- Homepage:
- Size: 129 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: readme.md
Awesome Lists containing this project
README
## Feina Activa (Servei d'Ocupació de Catalunya) CLI Job Scraper ##

This is a CLI based application made to parse jobs from the official webpage of Generalitat de Catalunya.
### Prerequisites ###
- Having the requirements installed (Selenium and BeautifulSoup).
- Having the webdriver downloaded and/or set to PATH, for the specific navigator. In my case I set it to Chrome.
### Arguments ###
The command line currently takes up to 4 arguments, two of them being mandatory (*output_path* and *results*), and,
at least, one of the remaining, *topic* and *province*.
- **output_path**: Absolute (preferably) path to stores JSON outputs.
- **results**: Integer for the number of results. Should be a divisor of 20 (each web updates increases the amount of jobs shown by 20).
- **topic**: Keyword/s used to search for specific words. If there is more than one, they should be space-separated and in quotes.
- **province**: Can be none, or one of the four catalan provinces (Barcelona, Girona, Tarragona, Lleida).
### Example ###
```
python .\main.py C:\feina_activa_parser\searchs --province barcelona --results 80
```
**Command line output**:

**JSON file**:

Results are indexed, starting from 0 as usual (80, in this example).