{"id":13737342,"url":"https://github.com/DataHerb/dataherb-flora","last_synced_at":"2025-05-08T13:33:53.751Z","repository":{"id":42203828,"uuid":"238546240","full_name":"DataHerb/dataherb-flora","owner":"DataHerb","description":"DataHerb Flora: The core of DataHerb","archived":false,"fork":false,"pushed_at":"2023-03-12T16:51:51.000Z","size":59,"stargazers_count":1,"open_issues_count":1,"forks_count":3,"subscribers_count":2,"default_branch":"master","last_synced_at":"2024-11-15T05:32:38.861Z","etag":null,"topics":["data","data-mining","data-science","datascience","dataset","datasets"],"latest_commit_sha":null,"homepage":"https://dataherb.github.io/flora","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/DataHerb.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null}},"created_at":"2020-02-05T20:52:27.000Z","updated_at":"2022-04-10T15:59:16.000Z","dependencies_parsed_at":"2024-01-06T15:27:05.740Z","dependency_job_id":null,"html_url":"https://github.com/DataHerb/dataherb-flora","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DataHerb%2Fdataherb-flora","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DataHerb%2Fdataherb-flora/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DataHerb%2Fdataherb-flora/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DataHerb%2Fdataherb-flora/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/DataHerb","download_url":"https://codeload.github.com/DataHerb/dataherb-flora/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":253077662,"owners_count":21850361,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data","data-mining","data-science","datascience","dataset","datasets"],"created_at":"2024-08-03T03:01:42.884Z","updated_at":"2025-05-08T13:33:53.744Z","avatar_url":"https://github.com/DataHerb.png","language":null,"funding_links":[],"categories":["Others"],"sub_categories":[],"readme":"# dataherb-flora\n\n\u003ch1 align=\"center\"\u003e\n  \u003cbr\u003e\n  \u003ca href=\"https://dataherb.github.io\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/DataHerb/dataherb.github.io/master/assets/favicon/ms-icon-310x310.png\" alt=\"Markdownify\" width=\"200\"\u003e\u003c/a\u003e\n  \u003cbr\u003e\n  DataHerb Flora\n  \u003cbr\u003e\n\u003c/h1\u003e\n\n\u003ch4 align=\"center\"\u003eA \u003ca href=\"https://dataherb.github.io\" target=\"_blank\"\u003eDataHerb\u003c/a\u003e Core Service to Bundle the Datasets into Flora.\u003c/h4\u003e\n\n\u003cp align=\"center\"\u003e\n    \u003cimg src=\"https://github.com/DataHerb/dataherb-flora/workflows/CI%20Update%20Jekyll/badge.svg?branch=master\"\u003e\n\u003c/p\u003e\n\n## What is DataHerb\n\nDataHerb is an open data initiative to make the access of open datasets easier.\n\n- A **DataHerb** or **Herb** is a dataset. A dataset comes with the data files, and the metadata of the data files.\n- A **DataHerb Leaf** or **Leaf** is a data file in the DataHerb.\n- A **Flora** is the combination of all the DataHerbs.\n\nIn many data projects, finding the right datasets to enhance your data is one of the most time consuming part. DataHerb adds flavor to your data project.\n\n## What is DataHerb Flora\n\nWe desigined the following workflow to share and index datasets.\n\n![DataHerb Workflow](https://raw.githubusercontent.com/DataHerb/dataherb.github.io/master/assets/images/dataherb-components.png)\n\nThis repository is being used for listing of datasets (Listings in DataHerb flora repository).\n\n## How to Add Your Dataset\n\n\u003e [A Complete **Tutorals**](https://dataherb.github.io/add/)\n\nSimply create a `yml` file in the `flora` folder to link to your dataset repository. Your dataset repository should have a `.dataherb` folder and a `metadata.yml` file in it.\n\nThe indexing part will be done by [GitHub Actions](https://github.com/DataHerb/dataherb-flora/actions).\n\n## How is Everything Connected\n\nThere are three components to build the dataset index.\n\n1. [dataherb-flora](https://github.com/DataHerb/dataherb-flora): Index datasets using yml files.\n2. [dataherb-metadata-aggregator](https://github.com/DataHerb/dataherb-metadata-aggregator): Aggregrates all information about the datasets and create database.\n3. [dataherb.github.io](https://github.com/DataHerb/dataherb.github.io): Builds the website using the database.\n\nSome packages are also created to make the access and creation of the datasets easier. Refer to [the website](https://dataherb.github.io/) for the details.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FDataHerb%2Fdataherb-flora","html_url":"https://awesome.ecosyste.ms/projects/github.com%2FDataHerb%2Fdataherb-flora","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FDataHerb%2Fdataherb-flora/lists"}