https://github.com/mdneuzerling/aphhansard
https://github.com/mdneuzerling/aphhansard
Last synced: 4 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/mdneuzerling/aphhansard
- Owner: mdneuzerling
- License: other
- Created: 2020-09-03T09:47:07.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2020-09-07T23:26:24.000Z (over 4 years ago)
- Last Synced: 2024-08-13T07:13:22.906Z (8 months ago)
- Language: R
- Size: 19.5 KB
- Stars: 2
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- jimsghstars - mdneuzerling/aphhansard - (R)
README
# aphhansard
[](https://github.com/mdneuzerling/aphhansard/tree/main)
[](https://github.com/mdneuzerling/aphhansard/actions)
[](https://choosealicense.com/licenses/mit/)The aim of this package is to parse [the Hansard of Australian Parliament House](https://www.aph.gov.au/Parliamentary_Business/Hansard) from XML into rectangular data frames. The Hansard is a written record of the proceedings of the Australian federal parliament.
This is a work in progress, and the vast majority of the Hansard cannot yet be parsed.
The Hansard is [provided by the Australian Parliament House](https://www.aph.gov.au/Help/Disclaimer_Privacy_Copyright) under a [Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Australia](creativecommons.org/licenses/by-nc-nd/3.0/au) licence.
## Installation
You can install the latest version from GitHub with:
``` r
remotes::install_packages("mdneuzerling/aphhansard")
```## Example
The largest functional component implemented thus far is the parsing of the "chamber" node of a single day of Hansard. This functionality has only been tested for the day below:
``` r
library(aphhansard)
xml_link <- "https://parlinfo.aph.gov.au/parlInfo/download/chamber/hansardr/2005-03-16/toc_unixml/3766-5.xml;fileType=text%2Fxml"
xml <- xml2::read_xml(xml_link)
chamber <- xml2::xml_child(xml, "chamber.xscript")
parse_node(chamber)
```