An open API service indexing awesome lists of open source software.

https://github.com/dataforgoodfr/offseason_missiontransition_datasource


https://github.com/dataforgoodfr/offseason_missiontransition_datasource

Last synced: 5 months ago
JSON representation

Awesome Lists containing this project

README

          

# offseason_missiontransition

Scrap all the aids from the API : https://aides-territoires.beta.gouv.fr/api/aids/

For each one gives the information of :
- name
- url
- if error on orig url and the type
- if error on app url and the type
- list of pdfs if available on url
- Whether a pdf contains a word from the list given here

`CRITERIA_WORDS = ["conditions", "critères", "éligible", "éligibilité"]`

## Install

```
pip install .
python scripts/scrap_pdf_files.py
```

## How to read a pdf from url
Following script shows how to read content from a sample pdf, using csv file generated
by scrap_pdf_files.py. It also creates a .json file with associated pdfs urls & contents.
```
python scripts/read_pdf_content_tutorial.py
```