Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/jwheare/bizparse
Python scraper for parsing the House of Commons Future Business pages
https://github.com/jwheare/bizparse
Last synced: 8 days ago
JSON representation
Python scraper for parsing the House of Commons Future Business pages
- Host: GitHub
- URL: https://github.com/jwheare/bizparse
- Owner: jwheare
- License: bsd-3-clause
- Created: 2009-08-13T00:10:26.000Z (about 15 years ago)
- Default Branch: master
- Last Pushed: 2009-08-13T07:39:06.000Z (about 15 years ago)
- Last Synced: 2023-04-13T08:18:50.918Z (over 1 year ago)
- Language: Python
- Homepage:
- Size: 125 KB
- Stars: 1
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.txt
- License: LICENSE.txt
Awesome Lists containing this project
README
bizparse.py
A scraper for parsing the House of Commons Future Business pages
http://www.publications.parliament.uk/pa/cm/cmfbusi/fbusi.htmUsage:
./bizparse.pyWrites an XML file to bizparseYYYY-MM-DD.xml for the period ending date.
Ouputs human readable debug logging for the data extracted to stdoutUses BeautifulSoup: http://www.crummy.com/software/BeautifulSoup/