Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/dolthub/dolthub-etl-jobs
ETL jobs that DoltHub maintained that load public data into DoltHub.
https://github.com/dolthub/dolthub-etl-jobs
Last synced: about 14 hours ago
JSON representation
ETL jobs that DoltHub maintained that load public data into DoltHub.
- Host: GitHub
- URL: https://github.com/dolthub/dolthub-etl-jobs
- Owner: dolthub
- License: apache-2.0
- Created: 2019-09-11T02:01:35.000Z (about 5 years ago)
- Default Branch: master
- Last Pushed: 2023-03-07T03:48:27.000Z (over 1 year ago)
- Last Synced: 2024-08-08T18:21:43.052Z (3 months ago)
- Language: Python
- Homepage:
- Size: 890 KB
- Stars: 18
- Watchers: 12
- Forks: 9
- Open Issues: 31
-
Metadata Files:
- Readme: README.md
- License: LICENSE.md
Awesome Lists containing this project
- awesome-starred - dolthub/dolthub-etl-jobs - ETL jobs that DoltHub maintained that load public data into DoltHub. (others)
README
# DoltHub ETL Jobs
This package contains legacy code used in an [Airflow](https://airflow.apache.org/) pipeline to update public databases under the
[`dolthub` organization](https://www.dolthub.com/organizations/dolthub) on [DoltHub](https://www.dolthub.com). It also stores some
adhoc scripts that were never meant to be run continuously but were used to import the data from its source. If you find an
interesting database on DoltHub under the `dolthub` organization, you may find the code that created it here. If you are wondering,
just ask us on our [Discord](https://discord.com/invite/RFwfYpu).[DoltHub](https://www.dolthub.com), the company, focused on making [Dolt](https://github.com/dolthub/dolt) a full-fledged
[version controlled database](https://www.dolthub.com/blog/2021-09-17-database-version-control/). We deprecated our Airflow
instance. These scripts are here for posterity sake.