Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/nationalparkservice/romn-soilsetl
Rocky Mountain Network - Soils Extract Transform and Load Script
https://github.com/nationalparkservice/romn-soilsetl
etl soils
Last synced: about 17 hours ago
JSON representation
Rocky Mountain Network - Soils Extract Transform and Load Script
- Host: GitHub
- URL: https://github.com/nationalparkservice/romn-soilsetl
- Owner: nationalparkservice
- License: gpl-3.0
- Created: 2022-09-19T17:50:18.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2023-06-23T18:06:00.000Z (over 1 year ago)
- Last Synced: 2024-04-14T19:24:20.332Z (7 months ago)
- Topics: etl, soils
- Language: Python
- Homepage:
- Size: 78.1 KB
- Stars: 0
- Watchers: 7
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.md
Awesome Lists containing this project
README
# romn-SoilsETL
The Github 'romn-SoilsETL' repository contains the python scripts (i.e. *ROMN_Soils_ETL_ToSoilsDB_gte2022.py* and *ROMN_Soils_ETL_To_SoilsDB_Pre2022.py*) which performs Extract Transform and Load (ETL) of the Colorado State University Soils Lab Electronic Data Deliverable (EDD)
to the Rocky Mountain Network Soil Database - master soils dataseet: 'tbl_SoilChemistry_Dataset'. Soils ETL is used for Uplands Vegetation and Wetlands Ecological Integrigty protocol workflow processing at the NPS IMD Rocky Mountain Network.**Defines Matching Metadata for Uplands Vegetation (VCSS) and Wetlands events in the Soils database.**
The Uplands Vegetation and Wetlands event table must be linked to the most current databases in the Soils database as defined in the 'soilsDB' parameter.**Defines the matching parameter name and units** as defined in the 'tlu_NameUnitCrossWalk' lookup table.
**Appends the transformed data (i.e. ETL)** to the Master Soils dataset 'tbl_SoilChemistry_Dataset' via the 'to_sql' functionality for dataframes in the [sqlAlchemy](https://pypi.org/project/sqlalchemy-access/) package. Install via pip install sqlalchemy-access in your python environment.
## ROMN_Soils_ETL_ToSoilsDB_gte2022.py
Extracts the soils EDD records for ROMN field season 2022 from the Colorado State University Soil, Water and Plant Testing laboratory post move from Fort Collins to Denver in 2022. This is the most current ETL script as of 5/3/2023
## ROMN_Soils_ETL_ToSoilsDB.py
Extracts the soils EDD records for ROMN field season 2021 from the Colorado State University Soil, Water and Plant Testing laboratory prior to the move from Fort Collins to Denver in 2022. This ETL route was used for field season 2021 soils data.