{"id":25817292,"url":"https://github.com/landscapegeoinformatics/grqa_src","last_synced_at":"2025-02-28T06:33:50.184Z","repository":{"id":128475238,"uuid":"335576210","full_name":"LandscapeGeoinformatics/GRQA_src","owner":"LandscapeGeoinformatics","description":"Scripts used during the creation of the Global River Water Quality Archive (GRQA)","archived":false,"fork":false,"pushed_at":"2022-09-07T07:06:50.000Z","size":87,"stargazers_count":4,"open_issues_count":0,"forks_count":0,"subscribers_count":3,"default_branch":"main","last_synced_at":"2024-06-05T19:27:52.621Z","etag":null,"topics":["hydrology","python","water-quality"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/LandscapeGeoinformatics.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2021-02-03T09:46:07.000Z","updated_at":"2024-02-26T04:50:50.000Z","dependencies_parsed_at":null,"dependency_job_id":"eb721ab6-85fa-47c6-a91e-d1c4544475d3","html_url":"https://github.com/LandscapeGeoinformatics/GRQA_src","commit_stats":{"total_commits":25,"total_committers":2,"mean_commits":12.5,"dds":0.4,"last_synced_commit":"1f6ce510e4db9f74672789a944215acf9c535702"},"previous_names":[],"tags_count":3,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/LandscapeGeoinformatics%2FGRQA_src","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/LandscapeGeoinformatics%2FGRQA_src/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/LandscapeGeoinformatics%2FGRQA_src/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/LandscapeGeoinformatics%2FGRQA_src/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/LandscapeGeoinformatics","download_url":"https://codeload.github.com/LandscapeGeoinformatics/GRQA_src/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":241112446,"owners_count":19911690,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["hydrology","python","water-quality"],"created_at":"2025-02-28T06:33:49.442Z","updated_at":"2025-02-28T06:33:50.174Z","avatar_url":"https://github.com/LandscapeGeoinformatics.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# GRQA_src\n\n[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.5082147.svg)](https://doi.org/10.5281/zenodo.5082147)\n\nScripts used during the creation of the Global River Water Quality Archive (GRQA).\n\nThe dataset can be downloaded at \u003chttps://zenodo.org/record/5101057\u003e\n\nThe data description paper is available at \u003chttps://essd.copernicus.org/articles/13/5483/2021/\u003e\n\nThe scripts are divided into two folders. Folder **preprocessing** contains scripts used for preprocessing raw source data into a common structure used for GRQA. Folder **grqa_processing** contains scripts used for processing the merged data, generating plots and statistics.\n\n**preprocessing** contains the following scripts:\n* *\\*\\_download* used for downloading source data\n* *\\*\\_units* for collecting water quality parameter units when multiple units per parameter were present in source data\n* *\\*\\_preprocessing* for source data cleaning and parameter harmonization to convert into a common structure used in GRQA\n* *WQP\\_merge\\_stats* for merging WQP time series statistics\n\n**grqa\\_preprocessing** contains the following scripts:\n* *\\*\\_param\\_codes* for creating a list of GRQA parameters used as an input for the parallel implementation of *\\*_obs\\_merging*\n* *\\*\\_obs\\_merging* used for merging harmonized source data, calculating time series statistics per site (outliers, monthly availability, continuity) and flagging potential duplicate observations\n* *\\*\\_param\\_stats* for calculating GRQA time series statistics per parameter\n* *\\*\\_plot\\_sites* for creating maps of observation site distribution, monthly availablity, monthly continuity and median value per parameter\n* *\\*\\_plot\\_hist* for creating temporal distribution plots, histograms and box plots per parameter\n* *\\*\\_plot\\_sites\\_grid* for creating maps of observation site distribution, monthly availablity, monthly continuity and median value of DO, DOC, TP and TSS for the paper\n* *\\*\\_plot\\_hist\\_grid* for creating temporal distribution plots, histograms and box plots of DO, DOC, TP and TSS for the paper\n\nEach Python script has a corresponding shell script that was used for submitting Slurm jobs to the HPC cluster of University of Tartu.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flandscapegeoinformatics%2Fgrqa_src","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Flandscapegeoinformatics%2Fgrqa_src","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flandscapegeoinformatics%2Fgrqa_src/lists"}