Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/Irio/national-congress-data
Datasets of congresspeople attendance, votes and propositions since past century
https://github.com/Irio/national-congress-data
Last synced: 1 day ago
JSON representation
Datasets of congresspeople attendance, votes and propositions since past century
- Host: GitHub
- URL: https://github.com/Irio/national-congress-data
- Owner: Irio
- License: gpl-3.0
- Created: 2017-12-09T22:44:10.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2017-12-30T13:57:59.000Z (almost 7 years ago)
- Last Synced: 2024-08-03T06:02:34.745Z (3 months ago)
- Language: Jupyter Notebook
- Homepage: https://www.kaggle.com/iriomk/brazilian-federal-legislative-activity
- Size: 66.4 KB
- Stars: 10
- Watchers: 2
- Forks: 3
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# National Congress of Brazil
## Setup
```console
$ pip3 install -r requirements.txt
```## Data collection
To download the sources from Chamber of Deputies website, use `src/fetch_*.py` scripts and Scrapy's `crawl` commands:
```console
$ python3 src/fetch_propositions.py
$ cd data_collection
$ scrapy crawl chamber_of_deputies_sessions \
--output ../data/sources/sessions.csv
$ cd ..
```The files will be downloaded to `data/sources`. To extract them to `data/sessions`:
```console
$ python3 src/extract_content_files.py
```