Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/ssciwr/jupyter-data-exploration

Data Exploration with Python and Jupyter [SSC Compact Course]
https://github.com/ssciwr/jupyter-data-exploration

jupyter-notebook pandas python ssc-compact-course

Last synced: 7 days ago
JSON representation

Data Exploration with Python and Jupyter [SSC Compact Course]

Awesome Lists containing this project

README

        

# [Data Exploration with Python and Jupyter](https://ssciwr.github.io/jupyter-data-exploration)

## Course description

Jupyter notebooks are a great tool for exploring and interacting with data using the Python programming language and its rich ecosystem of libraries.
In this course we will cover basic usage of the Pandas library to download a dataset, explore its contents, clean up missing or invalid data, filter the data according to different criteria, and plot visualizations of the data.

## Preparation

You will need a computer with Python, Jupyter and pandas installed.

If you don't already have this, I recommend installing [Anaconda](https://www.anaconda.com/download/success) (which contains all of this and more):

After installing

- open Jupyter (e.g. on Windows: `Start Menu -> Anaconda3 -> Jupyter Notebook`)

- create a new notebook (the web browser where Jupyter appeared, in top right click on New -> Notebook Python 3)

- type `import pandas` and press Shift+Enter (or click `Cell -> Run Cells` in the menu)

- if no error message appears, you are ready to start the course!

![setup screenshot](setup.apng)

## During the course

The notebooks created during the course will be automatically updated as they are edited at [jupyter-data-exploration-live](https://github.com/ssciwr/jupyter-data-exploration-live).

## Course materials

- Part 1: Python and Jupyter - [online slides](https://ssciwr.github.io/jupyter-data-exploration), [colab interactive notebook](https://colab.research.google.com/github/ssciwr/jupyter-data-exploration/blob/main/index.ipynb), [read-only notebook](index.ipynb)
- Part 2: Pandas with toy data - [online slides](https://ssciwr.github.io/jupyter-data-exploration/pandas-toy-data.slides.html), [colab interactive notebook](https://colab.research.google.com/github/ssciwr/jupyter-data-exploration/blob/main/pandas-toy-data.ipynb), [read-only notebook](pandas-toy-data.ipynb)
- Part 3: Pandas with real data - [online slides](https://ssciwr.github.io/jupyter-data-exploration/pandas-real-data.slides.html), [colab interactive notebook](https://colab.research.google.com/github/ssciwr/jupyter-data-exploration/blob/main/pandas-real-data.ipynb), [read-only notebook](pandas-real-data.ipynb)

## External resources

- pandas
- [getting started tutorials](https://pandas.pydata.org/docs/getting_started/intro_tutorials/index.html)
- [user guide](https://pandas.pydata.org/docs/user_guide/index.html)
- kaggle courses
- [pandas](https://www.kaggle.com/learn/pandas)
- [data cleaning](https://www.kaggle.com/learn/data-cleaning)