https://github.com/dask/dask-examples

Easy-to-run example notebooks for Dask
https://github.com/dask/dask-examples

hacktoberfest

Last synced: about 1 year ago
JSON representation

Easy-to-run example notebooks for Dask

Host: GitHub
URL: https://github.com/dask/dask-examples
Owner: dask
License: cc-by-sa-4.0
Created: 2018-03-04T02:58:32.000Z (over 8 years ago)
Default Branch: main
Last Pushed: 2024-01-12T14:04:09.000Z (over 2 years ago)
Last Synced: 2025-06-06T02:35:39.644Z (about 1 year ago)
Topics: hacktoberfest
Language: Jupyter Notebook
Homepage: https://examples.dask.org/
Size: 608 MB
Stars: 378
Watchers: 24
Forks: 225
Open Issues: 21
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE.txt

Awesome Lists containing this project

README

Dask Example Notebooks
======================

This repository includes easy-to-run example notebooks for Dask.
They are intended to be educational and give users a start on common workflows.

They should be easy to run locally if you download this repository.
They are also available on the cloud by clicking on the link below:

[![Binder](https://static.mybinder.org/badge_logo.svg)](https://mybinder.org/v2/gh/dask/dask-examples/main?urlpath=lab)
[![Build Status](https://github.com/dask/dask-examples/workflows/CI/badge.svg)](https://github.com/dask/dask-examples/actions?query=workflow%3ACI)

Contributing
------------

This repository is a great opportunity to start contributing to Dask.
Please note that examples submitted to this repository should follow these
guidelines:

1. Run top-to-bottom without intervention from the user
2. Not require external data sources that may disappear over time
(external data sources that are highly unlikely to disappear are fine)
3. Not be resource intensive, and should run within 2GB of memory
4. Be clear and contain enough prose to explain the topic at hand
5. Be concise and limited to one or two topics, such that a reader can
get through the example within a few minutes of reading
6. Be of general relevance to Dask users, and so not too specific on a
particular problem or use case

As an example "how to do dataframe joins" is a great topic while "how to
do dataframe joins in the particular case when one column is a categorical
and the other is object dtype" is probably too specific
7. If the example requires a library not included in `binder/environment.yml`
then it would be `pip` installed` in the first cell of the notebook, with a
brief explanation about what functionality the library adds. A brief
example follows:

```markdown
### Install Extra Dependencies

We first install the library X for interacting with Y
```

```python
!pip install X
```

Updating the Binder environment
-------------------------------

1. Modify `binder/environment-base.yml` with new or updated dependencies
2. Run a `linux/amd64` Docker container with `mamba` available. For example:

```shell
docker run --platform=linux/amd64 -it --rm --mount type=bind,source=$(pwd)/binder,target=/binder condaforge/mambaforge /bin/bash
```

This mounts the `./binder` folder in `/binder` in the Docker container
3. Create the environment

```shell
mamba env create -f environment-base.yml
```

This may take quite a while.
4. Export the environment specification:

```shell
mamba env export -n dask-examples --no-builds -f environment.yml
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/dask/dask-examples

Awesome Lists containing this project

README