https://github.com/hamedalemo/dask-tutorial
A tutorial to learn Dask DataArray and Dask DataFrames with examples from geospatial data catalogs.
https://github.com/hamedalemo/dask-tutorial
dask dask-dataframes dask-distributed geospatial geospatial-analysis geospatial-data
Last synced: 11 months ago
JSON representation
A tutorial to learn Dask DataArray and Dask DataFrames with examples from geospatial data catalogs.
- Host: GitHub
- URL: https://github.com/hamedalemo/dask-tutorial
- Owner: HamedAlemo
- License: mit
- Created: 2023-09-29T17:02:47.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2024-10-24T04:12:53.000Z (over 1 year ago)
- Last Synced: 2025-01-15T10:57:46.782Z (about 1 year ago)
- Topics: dask, dask-dataframes, dask-distributed, geospatial, geospatial-analysis, geospatial-data
- Language: Jupyter Notebook
- Homepage:
- Size: 18.5 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Dask Tutorial
This repository contains an introduction to Dask and tutorials to use Dask arrays and `stackstac` to retrieve a large number of satellite scenes from a STAC API using Dask. This is part of the course on [Advanced Geospatial Analytics with Python](https://hamedalemo.github.io/advanced-geo-python/intro.html) taught since Fall 2023 at Clark University.
## Requirements
You need to have Docker installed on your machine.
## Instructions
It's recommended to pull the Docker image from Dockerhub. Otherwise, if you prefer, you can build your own image using the instructions in the following section.
```
docker pull hamedalemo/dask-tutorial:1.3
```
```
docker run -it -p 8888:8888 -p 8787:8787 hamedalemo/dask-tutorial:1.3
```
Port `8787` is used by Dask Dashboard.
- Copy the Jupyter Lab url and paste it in your browser.
- Open `dask_intro.ipynb`, `stackstac.ipynb` or `dask_dataframe.ipynb` and follow the instructions.
Build the Docker image:
**Windows users:** It is highly recommended that you pull the Docker image, there seems to be an issue with conda-forge on WSL.
```
docker build -t dask-tutorial .
```
Run the container as following after switching to the repository's directory locally:
```
docker run -it -p 8888:8888 -p 8787:8787 dask-tutorial
```
- Copy the Jupyter Lab url and paste it in your browser.
- Open `dask_intro.ipynb`, `stackstac.ipynb` or `dask_dataframe.ipynb` and follow the instructions.