https://github.com/stef/ksh-scraper
scraper for the hungarian statistics office
https://github.com/stef/ksh-scraper
Last synced: 11 months ago
JSON representation
scraper for the hungarian statistics office
- Host: GitHub
- URL: https://github.com/stef/ksh-scraper
- Owner: stef
- Created: 2010-12-04T17:09:58.000Z (over 15 years ago)
- Default Branch: master
- Last Pushed: 2010-12-09T00:39:05.000Z (over 15 years ago)
- Last Synced: 2025-04-16T00:42:05.835Z (about 1 year ago)
- Language: Python
- Homepage:
- Size: 572 KB
- Stars: 9
- Watchers: 3
- Forks: 3
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# KSH 2 CSV
is a [H.A.C.K.](http://hspbp.org/) contribution to the [2010 Open Data Hackday](http://www.opendataday.org/). KSH is the [Hungarian Central Statistical Office](http://portal.ksh.hu/portal/page?_pageid=38,119919&_dad=portal&_schema=PORTAL) and it publishes it's data in some unprocessable html forms (btw those guys should look into sql injections to their precious Oracle DB).
## Usage
### console
ksh_scrape.py http://portal.ksh.hu/pls/ksh/docs/hun/xstadat/xstadat_eves/i_onp004b.html
### web
Simply prepend the URL from portal.ksh.hu with our online ksh-scrape webservice url **http://mx.ctrlc.hu/ksh-scrape?**, like this: [http://mx.ctrlc.hu/ksh-scrape?http://portal.ksh.hu/pls/ksh/docs/hun/xstadat/xstadat_eves/i_zoi011.html](http://mx.ctrlc.hu/ksh-scrape?http://portal.ksh.hu/pls/ksh/docs/hun/xstadat/xstadat_eves/i_zoi011.html)
### web + automagic
Using Greasemonkey [Firefox](https://addons.mozilla.org/en-US/firefox/addon/748/), [Chrome](http://blog.chromium.org/2010/02/40000-more-extensions.html), [IE](http://www.gm4ie.com/), you can install also the [userscript](https://github.com/stef/ksh-scraper/raw/master/ksh2csv.user.js) which will automatically insert "download as CSV" links on the pages with html-encumbered data.