Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/joelkoen/simple-abns
Simplify the ABR's Australian Business Number dataset for easier analysis
https://github.com/joelkoen/simple-abns
abn australia
Last synced: about 8 hours ago
JSON representation
Simplify the ABR's Australian Business Number dataset for easier analysis
- Host: GitHub
- URL: https://github.com/joelkoen/simple-abns
- Owner: joelkoen
- License: mit
- Created: 2024-04-26T06:43:14.000Z (7 months ago)
- Default Branch: main
- Last Pushed: 2024-04-26T06:43:21.000Z (7 months ago)
- Last Synced: 2024-10-05T19:34:49.752Z (about 1 month ago)
- Topics: abn, australia
- Language: Rust
- Homepage:
- Size: 11.7 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# simple-abns
simple-abns parses the [ABR's Australian Business Number dataset](https://data.gov.au/data/dataset/abn-bulk-extract) and converts it to a simpler JSON format.
You can download a copy of the [converted dataset](https://pub.joel.net.au/datasets/simple-abns/2024-04-24-simple-abns.jsonl.zst). Note that this is not updated automatically - please open an issue if it could use a refresh.
You can also find machine-readable names for the entity types the ABR uses in [./entity_types.json](./entity_types.json).
If you'd like to generate the dataset yourself, you'll need to download the raw XML data and place all 20 chunks in `./raw`. simple-abns will parse them and print each ABN record as a seperate line. You can see progress and compress the output using:
```sh
cargo run --release | pv -ls 18M | zstd -T0 -9 > simple-abns.jsonl.zst
```## Example
**Input**:
```xml
88712649015
SGE
State Government Entity
STATE EMERGENCY SERVICE (NSW)
NSW
2500
NEW SOUTH WALES STATE EMERGENCY SERVICE
```
**Output:**
```json
{
"abn": "88712649015",
"status": "Active",
"status_since": "1999-11-01",
"last_updated": "2024-04-12",
"entity_name": {
"type": "NonIndividual",
"name": "STATE EMERGENCY SERVICE (NSW)"
},
"entity_type": "SGE",
"trade_names": [
"NEW SOUTH WALES STATE EMERGENCY SERVICE"
],
"postcode": "2500",
"state": "NSW",
"gst_status": "Active",
"gst_status_since": "2000-07-01"
}
```