Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/alertadengue/episcanner-downloader
https://github.com/alertadengue/episcanner-downloader
Last synced: 4 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/alertadengue/episcanner-downloader
- Owner: AlertaDengue
- License: mit
- Created: 2023-05-04T17:20:57.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-05-29T04:18:04.000Z (8 months ago)
- Last Synced: 2024-05-29T04:18:15.867Z (8 months ago)
- Language: Python
- Size: 15.7 MB
- Stars: 0
- Watchers: 2
- Forks: 2
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE
Awesome Lists containing this project
README
# Episcanner Downloader
Episcanner Downloader is a data downloader for the Episcanner application. It retrieves data related to diseases like dengue, zika and chikungunya and saves it in a specified directory with the formats csv, parquet or duckdb.
## Features
- Fetches data related to diseases from the Episcanner application
- Supports downloading data for specific diseases
- Saves downloaded data to a designated directory## Installation
To install Episcanner Downloader, follow these steps:
1. Clone the repository:
```shell
git clone https://github.com/AlertaDengue/episcanner-downloader.git
```
2. Navigate to the cloned directory:
```shell
cd episcanner-downloader
```
### Using Conda1. Create a Conda environment using the provided YAML file:
```shell
conda env create -f conda/env-base.yaml
```
2. conda activate episcanner-downloader
```shell
conda activate episcanner-downloader
```
3. Install the dependencies using Poetry:
```shell
poetry install
```
### Using a Virtual Environment (venv)
1. Create a virtual environment:
```shell
python -m venv env
```
2. Activate the virtual environment:
```shell
source env/bin/activate
```
3. Install the dependencies using Poetry:
```shell
poetry install
```
## Setting Environment Variables
Before running Episcanner Downloader, make sure to set the required environment variables for connecting to the PSQL database. You can use the provided Makefile to create a .env file with the exported variables:
1. Set the required environment variables for connecting to the PSQL database:
```shell
export EPISCANNER_PSQL_URI="postgresql://user:password@host:port/database"
```2. Create a .env file in the project root directory with the exported variables.
```shell
make dotenv
```
## Usage
To use Episcanner Downloader, follow these steps:1. Open the python console or another python interpreter:
```python
from scanner import Episcanner
scanner = EpiScanner(disease="dengue", uf="RJ", year=2024)
scanner.export("duckdb")
```*Replace `uf` with the desired state (e.g., 'MG') and `disease` with the specific disease you want to download ('dengue', 'chikungunya' or 'zika'). Specify the `output_dir` on the `export()` method to change where the data should be saved.*
2. In order to read the data, open the file using `duckdb`:
```python
import duckdb
db = duckdb.connect("<$HOME>/episcanner/episcanner.duckdb")
db.execute("SELECT * FROM 'RJ' WHERE disease = 'dengue' AND year = 2024").fetchdf()
```Replace <$HOME> with your actual home directory or use the `output_dir` specified in the export method
## License
Episcanner Downloader is licensed under the [MIT License](https://github.com/AlertaDengue/episcanner-downloader/blob/main/LICENSE). See the LICENSE file for more details.