https://github.com/odnodn/covidsource

The covidsource project is a data ETL pipeline and set of visualizations for Grafana for COVID-19 data.
https://github.com/odnodn/covidsource

Last synced: 4 months ago
JSON representation

The covidsource project is a data ETL pipeline and set of visualizations for Grafana for COVID-19 data.

Host: GitHub
URL: https://github.com/odnodn/covidsource
Owner: odnodn
License: mit
Created: 2020-08-30T12:38:24.000Z (almost 5 years ago)
Default Branch: master
Last Pushed: 2020-07-20T06:04:39.000Z (almost 5 years ago)
Last Synced: 2024-04-17T01:08:04.782Z (about 1 year ago)
Homepage: https://c19.dev
Size: 687 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE.md

Awesome Lists containing this project

README

        # covidsource

The `covidsource` project is broken up into two distinct pieces:

1. An ETL pipeline using [AWS](https://aws.amazon.com/) and the [serverless](https://serverless.com/) framework to continually ingest available COVID-19 data into an [RDS Aurora](https://aws.amazon.com/rds/aurora/) datastore.

2. A set of [Grafana](https://grafana.com/) dashboards to visualize the data.

The ETL source code and Grafana dashboards will be backed up and stored here. I will also be providing helpful links to static data is used in this project.

## the sources of data

Below are the current data sources being populated.

|                    | Website          | Function Name | Run every | Data         |

| -----------------: | --------------------- | ------------------------- | ------------------ | ------------------ |

|        The Covid Tracking Project | [covidtracking.com](https://covidtracking.com/) | ingest_ctp | `1 hour` | [CSV](https://covidtracking.com/api/v1/states/daily.csv) |

| US Census Data / 2019 projections | [census.gov](https://www.census.gov/) | ingest_us | `manual` | [API](https://api.census.gov/data/2019/pep/population?get=NAME,COUNTY,STATE,DENSITY,POP&for=county:*) |

|        New York Times County Data | [nyt.com](https://www.nytimes.com/article/coronavirus-county-data-us.html) | ingest_nyt    | `3 hours` | [CSV](https://raw.githubusercontent.com/nytimes/covid-19-data/master/us-counties.csv) |

|               Apple Mobility Data | [apple.com](https://www.apple.com/covid19/mobility)          | ingest_apl    | `12 hours` | [CSV](https://www.apple.com/covid19/mobility)                |

|              Google Mobility Data | [google.com](https://www.google.com/covid19/mobility)        | ingest_gog  | `12 hours` | [CSV](https://www.gstatic.com/covid19/mobility/Global_Mobility_Report.csv) |

| COVID Act Now OBSERVED | [covidactnow.org](https://covidactnow.org) | ingest_can | `6 hours` | [CSV](https://data.covidactnow.org/latest/us/states.OBSERVED_INTERVENTION.timeseries.csv) |

|  Eric Celeste / US County GeoJSON | [eric.clst.org](https://eric.clst.org/tech/usgeojson/)       | -             | `6 hours`    | [JSON](https://eric.clst.org/assets/wiki/uploads/Stuff/gz_2010_us_050_00_20m.json) |

## use & contribute

You can contribute or fork the code to fit your needs. Just use the following steps:

1. Make sure you have an AWS account and install and authenticate the CLI with an access key and secret.

2. Make sure you have node.js installed

3. Install the [serverless](https://serverless.com/) framework:

   ```bash

   npm i -g serverless

   ```

4. Clone the repo into a new folder.

5. Switch to the folder.

6. You will need to create a `serverconfig.yaml` file that contains your secrets. The format is as follows:

   ```yaml

   dev:

     database:

       name: c19_dev_db_example

       username: c19_user

       password: superstrongpassword

     cfnRole: arn:aws:iam::00000000:role/your-own-special-lambda-role

   

   prod:

     database:

       name: c19_prod_db_example

       username: c19_user

       password: superstrongpassword

     cfnRole: arn:aws:iam::00000000:role/your-own-special-lambda-role

   ```

   

7. Install the dependencies

   ```bash

   npm i

   ```

8. Deploy the API

   ```bash

   sls deploy

   ```

   *Keep in mind here, that I've got my serverless.yaml configured to use the custom domain (which you will not be able to use, but I'm leaving the config in the file so you can see how I did it.)*

9. Seed the database. Some ETL lambdas run on a schedule, but if you'd like to run them immediately to seed the database, you can do so by running the commands manually using the `invoke` function (see the table above).

   ```bash

   sls invoke -f [function name] -s (prod|dev)

   ```

   Use the *stage* param to specify `prod` or `dev`. If you don't specify a `-s` it will assume `dev`

## follow the live build

More information is available on the https://c19.dev website. I'm currently live streaming the build of the API and dashboards on [Twitch](https://www.twitch.tv/hrudotsh) and posting the videos afterwards on [YouTube](https://www.youtube.com/channel/UC_D6qpdhkoJJZE_CGFyVhog).

/ [Matt](https://www.linkedin.com/in/hrushka/)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/odnodn/covidsource

Awesome Lists containing this project

README