Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/reagro/carob
Aggregating agronomic data
https://github.com/reagro/carob
Last synced: about 2 months ago
JSON representation
Aggregating agronomic data
- Host: GitHub
- URL: https://github.com/reagro/carob
- Owner: reagro
- License: gpl-3.0
- Created: 2021-05-31T18:55:24.000Z (over 3 years ago)
- Default Branch: master
- Last Pushed: 2024-10-29T09:21:59.000Z (2 months ago)
- Last Synced: 2024-10-29T09:54:43.794Z (2 months ago)
- Language: R
- Homepage:
- Size: 5.08 MB
- Stars: 20
- Watchers: 2
- Forks: 28
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- open-sustainable-technology - Carob - Create reproducible workflows that reshape primary agricultural research data from experiments and surveys into a standard format, and to aggregate individual data sets into larger collections that can be used in further research. (Consumption / Agriculture and Nutrition)
README
# Carob
*Carob* creates reproducible workflows that standardize primary agricultural research data from experiments and surveys. Standardization includes the use of a common file format, variable names, units and accepted values according to the [terminag](https://github.com/reagro/terminag) standard. Standardized data sets are aggregated into larger collections that can be used in further research. We do this by writing an *R* script for each individual dataset. See the [website](https://carob-data.org) for more information.
Carob is an open access *Extract, Transform, and Load* (ETL) framework supported by [CGIAR](https://www.cgiar.org/initiative/excellence-in-agronomy/) to support predictive analytics (machine learning, artifical intelligence) and other types of data analysis.
Contributions are welcome from anyone, and they can be made via pull-requests. Feel free to improve these scripts, or provide new ones. See the instructions on how to write a Carob script described [here](https://carob-data.org/contribute/index.html). You can also [raise an issues](https://github.com/reagro/carob/issues). A good place to discover new data sets is the [Gardian](https://gardian.bigdata.cgiar.org/) website or our [to-do list](https://carob-data.org/contribute/todo.html).
### Get the data
Compiled versions of the dataset can be downloaded from [carob-data.org](http://carob-data.org) and some will eventually be made available on the [carob dataverse](https://dataverse.harvard.edu/dataverse/carob/).
You can also compile your own version by cloning the repo and running
```
remotes::install_github("reagro/carobiner")
ff <- carobiner::make_carob(path)
```where `path` is the folder of the cloned repo (e.g. `"d:/github/carob"`)