Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ul-mds/gecko-data
Example data sources as a starting point for working with Gecko
https://github.com/ul-mds/gecko-data
data-science numpy pandas python record-linkage
Last synced: 18 days ago
JSON representation
Example data sources as a starting point for working with Gecko
- Host: GitHub
- URL: https://github.com/ul-mds/gecko-data
- Owner: ul-mds
- License: mit
- Created: 2024-03-12T12:13:25.000Z (10 months ago)
- Default Branch: main
- Last Pushed: 2024-03-12T12:38:36.000Z (10 months ago)
- Last Synced: 2024-03-13T09:36:11.323Z (10 months ago)
- Topics: data-science, numpy, pandas, python, record-linkage
- Language: Jupyter Notebook
- Homepage:
- Size: 4.66 MB
- Stars: 1
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
This repository contains example data for use with [Gecko](https://github.com/ul-mds/gecko) — a Python library for generation and mutation of realistic data at scale.
All data present in this repository is collected from public data sources.
Feel free to use it to test Gecko's capabilities.# Usage
Clone this repository to an easy-to-find location.
```bash
git clone https://github.com/ul-mds/gecko-data.git
```You can now use the data in this repository in your Gecko scripts.
# Structure
Subdirectories are locales which identify the language and country that the data contained within pertains to.
Currently, there exists data for the following locales:- [English (United States)](./en_US/)
- [German (Germany)](./de_DE/)There is also a [directory called "common"](./common/) which contains data independent of language and country.
Each subdirectory contains a README file which lists the sources and additional pre-processing steps to make the data easier to work with.
If you intend to publish data generated with data from this repository, please take care to cite the correct sources from these README files.# License
Gecko is released under the MIT License.