An open API service indexing awesome lists of open source software.

https://github.com/jwheare/bizparse

Python scraper for parsing the House of Commons Future Business pages
https://github.com/jwheare/bizparse

Last synced: about 1 year ago
JSON representation

Python scraper for parsing the House of Commons Future Business pages

Awesome Lists containing this project

README

          

bizparse.py

A scraper for parsing the House of Commons Future Business pages
http://www.publications.parliament.uk/pa/cm/cmfbusi/fbusi.htm

Usage:
./bizparse.py

Writes an XML file to bizparseYYYY-MM-DD.xml for the period ending date.
Ouputs human readable debug logging for the data extracted to stdout

Uses BeautifulSoup: http://www.crummy.com/software/BeautifulSoup/