Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ssciwr/jupyter-data-exploration
Data Exploration with Python and Jupyter [SSC Compact Course]
https://github.com/ssciwr/jupyter-data-exploration
jupyter-notebook pandas python ssc-compact-course
Last synced: 7 days ago
JSON representation
Data Exploration with Python and Jupyter [SSC Compact Course]
- Host: GitHub
- URL: https://github.com/ssciwr/jupyter-data-exploration
- Owner: ssciwr
- License: mit
- Created: 2021-09-14T10:21:20.000Z (about 3 years ago)
- Default Branch: main
- Last Pushed: 2024-06-13T09:55:56.000Z (5 months ago)
- Last Synced: 2024-06-13T19:22:00.458Z (5 months ago)
- Topics: jupyter-notebook, pandas, python, ssc-compact-course
- Language: Jupyter Notebook
- Homepage: https://ssciwr.github.io/jupyter-data-exploration
- Size: 2.72 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 4
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# [Data Exploration with Python and Jupyter](https://ssciwr.github.io/jupyter-data-exploration)
## Course description
Jupyter notebooks are a great tool for exploring and interacting with data using the Python programming language and its rich ecosystem of libraries.
In this course we will cover basic usage of the Pandas library to download a dataset, explore its contents, clean up missing or invalid data, filter the data according to different criteria, and plot visualizations of the data.## Preparation
You will need a computer with Python, Jupyter and pandas installed.
If you don't already have this, I recommend installing [Anaconda](https://www.anaconda.com/download/success) (which contains all of this and more):
After installing
- open Jupyter (e.g. on Windows: `Start Menu -> Anaconda3 -> Jupyter Notebook`)
- create a new notebook (the web browser where Jupyter appeared, in top right click on New -> Notebook Python 3)
- type `import pandas` and press Shift+Enter (or click `Cell -> Run Cells` in the menu)
- if no error message appears, you are ready to start the course!
![setup screenshot](setup.apng)
## During the course
The notebooks created during the course will be automatically updated as they are edited at [jupyter-data-exploration-live](https://github.com/ssciwr/jupyter-data-exploration-live).
## Course materials
- Part 1: Python and Jupyter - [online slides](https://ssciwr.github.io/jupyter-data-exploration), [colab interactive notebook](https://colab.research.google.com/github/ssciwr/jupyter-data-exploration/blob/main/index.ipynb), [read-only notebook](index.ipynb)
- Part 2: Pandas with toy data - [online slides](https://ssciwr.github.io/jupyter-data-exploration/pandas-toy-data.slides.html), [colab interactive notebook](https://colab.research.google.com/github/ssciwr/jupyter-data-exploration/blob/main/pandas-toy-data.ipynb), [read-only notebook](pandas-toy-data.ipynb)
- Part 3: Pandas with real data - [online slides](https://ssciwr.github.io/jupyter-data-exploration/pandas-real-data.slides.html), [colab interactive notebook](https://colab.research.google.com/github/ssciwr/jupyter-data-exploration/blob/main/pandas-real-data.ipynb), [read-only notebook](pandas-real-data.ipynb)## External resources
- pandas
- [getting started tutorials](https://pandas.pydata.org/docs/getting_started/intro_tutorials/index.html)
- [user guide](https://pandas.pydata.org/docs/user_guide/index.html)
- kaggle courses
- [pandas](https://www.kaggle.com/learn/pandas)
- [data cleaning](https://www.kaggle.com/learn/data-cleaning)