{"id":21490462,"url":"https://github.com/reactual/datalibrary","last_synced_at":"2025-03-17T11:12:54.428Z","repository":{"id":42356438,"uuid":"146624016","full_name":"reactual/datalibrary","owner":"reactual","description":"An API for better datasets","archived":false,"fork":false,"pushed_at":"2022-12-08T14:23:09.000Z","size":1041,"stargazers_count":0,"open_issues_count":19,"forks_count":0,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-01-23T20:34:14.024Z","etag":null,"topics":["api","graphql","mit-license"],"latest_commit_sha":null,"homepage":"https://datalibrary.com","language":"JavaScript","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/reactual.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2018-08-29T15:54:11.000Z","updated_at":"2020-07-30T12:57:09.000Z","dependencies_parsed_at":"2023-01-25T05:31:23.148Z","dependency_job_id":null,"html_url":"https://github.com/reactual/datalibrary","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/reactual%2Fdatalibrary","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/reactual%2Fdatalibrary/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/reactual%2Fdatalibrary/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/reactual%2Fdatalibrary/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/reactual","download_url":"https://codeload.github.com/reactual/datalibrary/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":244022715,"owners_count":20385134,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["api","graphql","mit-license"],"created_at":"2024-11-23T14:37:44.398Z","updated_at":"2025-03-17T11:12:54.408Z","avatar_url":"https://github.com/reactual.png","language":"JavaScript","readme":"# DataLibrary\n\u003e An API for better datasets -- https://datalibrary.com\n\n## Overview\nDataLibrary was created to bring datasets from a range of subjects into a single API. Our primary goal is consistency and ease of use.\n\nFor example, take a random selection of datasets:\n\n* List of Metric Units\n* List of US States\n* List of English Stopwords\n* Air Pollution Measurement Data\n* List of AWS \u0026 GCP Data Center Regions\n* Public financial data from 2 different municipalities\n\nBefore DataLibrary, you would most likely access these datasets from different sources. Beyond the technical challenges, each provider would typically use different schema patterns, naming conventions, and formatting.\n\nDataLibrary exists not only to bring datasets together into a single source, but also clean and reformat data when possible.\nFor common subjects, data could be combined from several sources to create a new, richer\ndataset, with fields and metadata carefully renamed for a better experience.\n\n## Access\nThe DataLibrary API will initially be available via GraphQL, with a RESTful HTTP API following. A frontend for searching datasets and other features will be available also.\n\n## Copyright Notes\n\u003e **DataLibrary's goal is to make data more accessible.**\n\u003e We take licensing and copyrights seriously.\n\nFor datasets where a copyright wouldn't apply, DataLibrary will typically host a formatted version of the data directly. This especially applies to common or infrequently changing datasets.\n\nDataLibrary supports datasets that contain copyrights, premium, and paid datasets, when approved by a provider.\n\n**A few example strategies:**\n\n* Maintaining our own agreement/terms with a provider.\n* Acting as a proxy where you bring your own license/token, not maintaining a local copy.\n* Providing an API or local library for formatting raw data from a dataset template we have.\n* Acting as a paid, *data* app store where we provide access to a dataset that generates revenue for a provider.\n* Providing generic utilities for cleaning \u0026 working with your own data.\n\n\n---\n\u003cimg src=\"/assets/logo_icon.png\" alt=\"Logo\" width=\"60\"\u003e\n\nA project by Reactual\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Freactual%2Fdatalibrary","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Freactual%2Fdatalibrary","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Freactual%2Fdatalibrary/lists"}