https://github.com/edsu/data-gov-uk-harvester
tiny little project to harvest rdfa metadata from data.gov.uk
https://github.com/edsu/data-gov-uk-harvester
Last synced: over 1 year ago
JSON representation
tiny little project to harvest rdfa metadata from data.gov.uk
- Host: GitHub
- URL: https://github.com/edsu/data-gov-uk-harvester
- Owner: edsu
- Created: 2010-11-26T03:12:49.000Z (over 15 years ago)
- Default Branch: master
- Last Pushed: 2010-12-02T11:46:55.000Z (over 15 years ago)
- Last Synced: 2024-10-12T05:57:51.380Z (over 1 year ago)
- Language: Python
- Homepage:
- Size: 2.78 MB
- Stars: 6
- Watchers: 3
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README
Awesome Lists containing this project
README
This is a rdfa crawler for the data.gov.uk site. It walks the
complete listing of datasets and extracts the metadata from the
HTML that is expressed as RDFa.
After you install python and rdflib you should be able to run it:
% crawl.py
Comments, questions:
Ed Summers