Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/nafisalawalidris/data-visualization-with-python
In the final project of Data Visualisation with Python, you create impactful visualizations using popular libraries like Matplotlib, Seaborn, and Plotly. Apply skills learned throughout the course to analyze real-world data, effectively communicate insights, and enhance your data visualisation proficiency.
https://github.com/nafisalawalidris/data-visualization-with-python
dash dashboards data-visualization folium jupyter-notebook matplotlib numpy python seaborn
Last synced: about 1 month ago
JSON representation
In the final project of Data Visualisation with Python, you create impactful visualizations using popular libraries like Matplotlib, Seaborn, and Plotly. Apply skills learned throughout the course to analyze real-world data, effectively communicate insights, and enhance your data visualisation proficiency.
- Host: GitHub
- URL: https://github.com/nafisalawalidris/data-visualization-with-python
- Owner: nafisalawalidris
- Created: 2023-05-10T18:38:40.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-05-21T23:20:38.000Z (over 1 year ago)
- Last Synced: 2023-10-07T00:25:13.180Z (about 1 year ago)
- Topics: dash, dashboards, data-visualization, folium, jupyter-notebook, matplotlib, numpy, python, seaborn
- Language: Jupyter Notebook
- Homepage:
- Size: 4.11 MB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Data Visualization with Python
## Objectives
* Create Data Visualization with Python
* Use various Python libraries for visualization## Introduction
The aim of these labs is to introduce you to data visualization with Python as concrete and as consistent as possible.
Speaking of consistency, because there is no *best* data visualization library available for Python - up to creating these labs - we have to introduce different libraries and show their benefits when we are discussing new visualization concepts. Doing so, we hope to make students well-rounded with visualization libraries and concepts so that they are able to judge and decide on the best visualization technique and tool for a given problem *and* audience.Please make sure that you have completed the prerequisites for this course, namely [**Python Basics for Data Science**](https://www.edx.org/course/python-basics-for-data-science-2?utm_medium=Exinfluencer&utm_source=Exinfluencer&utm_content=000026UJ&utm_term=10006555&utm_id=NA-SkillsNetwork-Channel-SkillsNetworkCoursesIBMDeveloperSkillsNetworkDV0101ENSkillsNetwork20297740-2021-01-01) and [**Analyzing Data with Python**](https://www.edx.org/course/data-analysis-with-python?utm_medium=Exinfluencer&utm_source=Exinfluencer&utm_content=000026UJ&utm_term=10006555&utm_id=NA-SkillsNetwork-Channel-SkillsNetworkCoursesIBMDeveloperSkillsNetworkDV0101ENSkillsNetwork20297740-2021-01-01).
**Note**: The majority of the plots and visualizations will be generated using data stored in *pandas* dataframes. Therefore, in this lab, we provide a brief crash course on *pandas*. However, if you are interested in learning more about the *pandas* library, detailed description and explanation of how to use it and how to clean, munge, and process data stored in a *pandas* dataframe are provided in our course [**Analyzing Data with Python**](https://www.edx.org/course/data-analysis-with-python?utm_medium=Exinfluencer&utm_source=Exinfluencer&utm_content=000026UJ&utm_term=10006555&utm_id=NA-SkillsNetwork-Channel-SkillsNetworkCoursesIBMDeveloperSkillsNetworkDV0101ENSkillsNetwork20297740-2021-01-01).
***
## Table of Contents1. [Exploring Datasets with *pandas*](#0)
1.1 [The Dataset: Immigration to Canada from 1980 to 2013](#2)
1.2 [*pandas* Basics](#4)
1.3 [*pandas* Intermediate: Indexing and Selection](#6)
2\. [Visualizing Data using Matplotlib](#8)
2.1 [Matplotlib: Standard Python Visualization Library](#10)
3\. [Line Plots](#12)
# Exploring Datasets with *pandas**pandas* is an essential data analysis toolkit for Python. From their [website](http://pandas.pydata.org/?utm_medium=Exinfluencer&utm_source=Exinfluencer&utm_content=000026UJ&utm_term=10006555&utm_id=NA-SkillsNetwork-Channel-SkillsNetworkCoursesIBMDeveloperSkillsNetworkDV0101ENSkillsNetwork20297740-2021-01-01):
> *pandas* is a Python package providing fast, flexible, and expressive data structures designed to make working with “relational” or “labeled” data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, **real world** data analysis in Python.
The course heavily relies on *pandas* for data wrangling, analysis, and visualization. We encourage you to spend some time and familiarize yourself with the *pandas* API Reference: [http://pandas.pydata.org/pandas-docs/stable/api.html](http://pandas.pydata.org/pandas-docs/stable/api.html?utm_medium=Exinfluencer&utm_source=Exinfluencer&utm_content=000026UJ&utm_term=10006555&utm_id=NA-SkillsNetwork-Channel-SkillsNetworkCoursesIBMDeveloperSkillsNetworkDV0101ENSkillsNetwork20297740-2021-01-01).
## The Dataset: Immigration to Canada from 1980 to 2013
Dataset Source: [International migration flows to and from selected countries - The 2015 revision](http://www.un.org/en/development/desa/population/migration/data/empirical2/migrationflows.shtml?utm_medium=Exinfluencer&utm_source=Exinfluencer&utm_content=000026UJ&utm_term=10006555&utm_id=NA-SkillsNetwork-Channel-SkillsNetworkCoursesIBMDeveloperSkillsNetworkDV0101ENSkillsNetwork20297740-2021-01-01).The dataset contains annual data on the flows of international immigrants as recorded by the countries of destination. The data presents both inflows and outflows according to the place of birth, citizenship or place of previous / next residence both for foreigners and nationals. The current version presents data pertaining to 45 countries.
In this lab, we will focus on the Canadian immigration data.
![Data Preview](https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/IBMDeveloperSkillsNetwork-DV0101EN-SkillsNetwork/labs/Module%201/images/DataSnapshot.png)
The Canada Immigration dataset can be fetched from here.
***
# Pie Charts, Box Plots, Scatter Plots, and Bubble Plots
## Objectives
* Explore Matplotlib library further
* Create pie charts, box plots, scatter plots and bubble charts## Table of Contents
1. [Exploring Datasets with *p*andas](#0)
2. [Downloading and Prepping Data](#2)
3. [Visualizing Data using Matplotlib](#4)
4. [Pie Charts](#6)
5. [Box Plots](#8)
6. [Scatter Plots](#10)
7. [Bubble Plots](#12)# Waffle Charts, Word Clouds, and Regression Plots
## Objectives
* Create Word cloud and Waffle charts
* Create regression plots with Seaborn library## Table of Contents
1. [Exploring Datasets with *p*andas](#0)
2. [Downloading and Prepping Data](#2)
3. [Visualizing Data using Matplotlib](#4)
4. [Waffle Charts](#6)
5. [Word Clouds](#8)
6. [Regression Plots](#10)# Generating Maps with Python
## Objectives
* Visualize geospatial data with Folium## Introduction
In this lab, we will learn how to create maps for different objectives. To do that, we will part ways with Matplotlib and work with another Python visualization library, namely **Folium**. What is nice about **Folium** is that it was developed for the sole purpose of visualizing geospatial data. While other libraries are available to visualize geospatial data, such as **plotly**, they might have a cap on how many API calls you can make within a defined time frame. **Folium**, on the other hand, is completely free.
## Table of Contents
1. [Exploring Datasets with *pandas*](#0)
2. [Downloading and Prepping Data](#2)
3. [Introduction to Folium](#4)
4. [Map with Markers](#6)
5. [Choropleth Maps](#8)## Story:
As a data analyst, the given a task to monitor and report US domestic airline flights performance. Goal is to analyze the performance of the reporting airline to improve flight reliability thereby improving customer reliability.Below are the key report items,
- Yearly airline performance report
- Yearly average flight delay statistics
NOTE: Year range is between 2005 and 2020.## Components of the report items
1. Yearly airline performance report
For the chosen year provide,- Number of flights under different cancellation categories using bar chart.
- Average flight time by reporting airline using line chart.
- Percentage of diverted airport landings per reporting airline using pie chart.
- Number of flights flying from each state using choropleth map.
- Number of flights flying to each state from each reporting airline using treemap chart.2. Yearly average flight delay statistics
For the chosen year provide,- Monthly average carrier delay by reporting airline for the given year.
- Monthly average weather delay by reporting airline for the given year.
- Monthly average national air system delay by reporting airline for the given year.
- Monthly average security delay by reporting airline for the given year.
- Monthly average late aircraft delay by reporting airline for the given year.### Requirements to create the expected result
- Two dropdown menus: For choosing report type and year
- Each dropdown will be designed as follows:
- An outer division with two inner divisions (as shown in the expected layout)
- One of the inner divisions will have information about the dropdown and the other one is dropdown.
- Layout for adding graphs.
- Callback function to compute data, create graph and return to the layout.