https://github.com/xetdata/clinicaltrials.gov
Dataset repo containing all of the trials listed on Clinicaltrials.gov
https://github.com/xetdata/clinicaltrials.gov
Last synced: 8 months ago
JSON representation
Dataset repo containing all of the trials listed on Clinicaltrials.gov
- Host: GitHub
- URL: https://github.com/xetdata/clinicaltrials.gov
- Owner: xetdata
- Created: 2023-12-21T01:12:33.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2023-12-21T01:13:12.000Z (over 2 years ago)
- Last Synced: 2025-07-31T16:42:33.043Z (10 months ago)
- Language: Python
- Size: 4.88 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Easy ETL Template
The templated version of our [Easy ETL example](https://github.com/xetdata/easy-etl).
## Usage
Configuring the repo:
1. Click `Use this Template`, and select `Create a new repository`.
2. Fill out the repository information and click `Create Repository`.
3. Add write permissions to Actions. This enables the pipeline to check the files back into the repo. To do so, click on the repo Settings. Select `Actions` and `General`. Scroll down to `Workflow permissions`, and select `Read and write permissions`. Click `Save`.
4. Follow the instructions at [XetData integration for GitHub](https://github.com/apps/xetdata) and install the app to your new repo
5. Edit `.github/workflows/etl-action.yml` and remove the comments from the schedule section (lines 8-10).
6. In `.github/workflows/etl-action.yml`, set `AUTOMATION_USERNAME` to a good username and `AUTOMATION_EMAIL` to a good email address (lines 5 and 6).
Making it your own:
1. Edit `src/pipeline.py` and replace the code in `extract()` with your own ETL code.
2. Make sure to save your requirements with `pip freeze > requirements.txt`
3. Commit and push your changes.
At the 21st minute of the next hour, your ETL pipeline will be run. Couldn't be easier!