Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/mavimo/scadenze_fiscali
Create a scraper to generate a parsable output (CSV, XML,JSON, ..) of tax deadline in Italy
https://github.com/mavimo/scadenze_fiscali
Last synced: about 2 months ago
JSON representation
Create a scraper to generate a parsable output (CSV, XML,JSON, ..) of tax deadline in Italy
- Host: GitHub
- URL: https://github.com/mavimo/scadenze_fiscali
- Owner: mavimo
- Created: 2012-10-20T16:17:49.000Z (over 12 years ago)
- Default Branch: master
- Last Pushed: 2012-10-20T17:36:08.000Z (over 12 years ago)
- Last Synced: 2024-12-01T04:52:12.356Z (about 2 months ago)
- Language: Python
- Size: 309 KB
- Stars: 1
- Watchers: 3
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Scadenze fiscali
Create a scraper to generate a parsable output (CSV, XML,JSON, ..) of tax deadline in Italy.
## Install
Install [scrapy](http://scrapy.org/) using PIP:
```bash
sudo pip install scrapy
```clone this repo:
```bash
git clone [email protected]:mavimo/scadenze_fiscali.git
```## Run it
Open a terminal and go in the project folder
```bash
cd scadenze_fiscali
```Choise the exporter, you can use ```JSON```, ```XML``` and ```CSV``` format (you can also use other exporter from scrapy, now you can run scraper using:
```bash
scrapy crawl scadenze_fiscali -o result/items.xml -t xml
```* ```-o```: parameter define the generated result file
* ```-t```: pameter define the export format## Generated data
Generated file have the following structure:
* ```items```
* ```item```
* ```when```: day of deadline
* ```who```: who need to respect this deadline
* ```what```: the deadline description
* ```how```: how user can "resolve" the deadline
* ```code```: code of this deadline type
* ```type```: type of responsability
* ```category```: Category of user that must rquire to respect the deadlineYou can find more info in the wiki (Work In Progres)
## Know issues
* This tool grab data for the specified year information. This data is wired in code, so if you require information for a different year is required to change this information directlry in the
spider file.
* Exported JSON data generate have text encoding error. Is required for you to re-convert generated file.## Disclaimer
All information will be grabbed directly from [Inland Revenue](http://www1.agenziaentrate.gov.it/documentazione/scadenzefiscali/index.htm). I don't assume responsability for data or damange that
this data can causate.