Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/dingoeatingfuzz/noaa-cdo
https://github.com/dingoeatingfuzz/noaa-cdo
Last synced: 18 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/dingoeatingfuzz/noaa-cdo
- Owner: DingoEatingFuzz
- Created: 2022-06-23T23:43:42.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2023-08-08T21:22:42.000Z (over 1 year ago)
- Last Synced: 2024-11-08T22:19:19.583Z (2 months ago)
- Language: Rust
- Size: 27.3 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
## NOAA CDO -> CSV -> Parquet -> DuckDB
This repo is two things:
1. A rust script for converting the special-formatted NOAA CDO data into CSVs
2. A few node scripts for creating DuckDBs out of these data parquet files**This repo does not include CSV -> Parquet processing, but that's because you [can grab this utility instead](https://github.com/domoritz/csv2parquet).
## TODO
1. Create a Makefile for downloading the NOAA data, for now you can [grab it from the source](https://www1.ncdc.noaa.gov/pub/data/ghcn/daily/) (you want `ghcnd-stations.txt` and `ghcnd_gsn.tar.gz`).
2. Publish the built csv/parquet/duckdb artifacts to GitHub releases, maybe.