{"id":21744749,"url":"https://github.com/milaan9/10_python_pandas_module","last_synced_at":"2025-04-06T08:13:34.505Z","repository":{"id":40995728,"uuid":"377796692","full_name":"milaan9/10_Python_Pandas_Module","owner":"milaan9","description":"Pandas is a high-level data manipulation tool developed by Wes McKinney. It is built on the Numpy package and its key data structure is called the DataFrame. DataFrames allow you to store and manipulate tabular data in rows of observations and columns of variables.","archived":false,"fork":false,"pushed_at":"2022-12-09T21:06:17.000Z","size":3612,"stargazers_count":242,"open_issues_count":0,"forks_count":228,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-03-30T06:09:00.866Z","etag":null,"topics":["dataframe-drop-columns","dataframe-drop-duplicates","dataframe-from-dictionary","dataframe-from-list","dataframe-to-dictionary","pandas-dataframe","pandas-datareader","pandas-excercises","pandas-python","pandas-series","pandas-tricks-for-data-manipulation","pandas-tutorial","python-pandas","python-tutor","python-tutorial-github","python-tutorial-notebook","python-tutorials","python4beginner","python4datascience","tutor-milaan9"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/milaan9.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2021-06-17T10:48:31.000Z","updated_at":"2025-03-21T01:34:07.000Z","dependencies_parsed_at":"2023-01-25T21:45:23.413Z","dependency_job_id":null,"html_url":"https://github.com/milaan9/10_Python_Pandas_Module","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/milaan9%2F10_Python_Pandas_Module","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/milaan9%2F10_Python_Pandas_Module/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/milaan9%2F10_Python_Pandas_Module/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/milaan9%2F10_Python_Pandas_Module/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/milaan9","download_url":"https://codeload.github.com/milaan9/10_Python_Pandas_Module/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247451665,"owners_count":20940944,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["dataframe-drop-columns","dataframe-drop-duplicates","dataframe-from-dictionary","dataframe-from-list","dataframe-to-dictionary","pandas-dataframe","pandas-datareader","pandas-excercises","pandas-python","pandas-series","pandas-tricks-for-data-manipulation","pandas-tutorial","python-pandas","python-tutor","python-tutorial-github","python-tutorial-notebook","python-tutorials","python4beginner","python4datascience","tutor-milaan9"],"created_at":"2024-11-26T07:12:26.681Z","updated_at":"2025-04-06T08:13:34.486Z","avatar_url":"https://github.com/milaan9.png","language":"Jupyter Notebook","readme":"\u003cp align=\"center\"\u003e \n\u003ca href=\"https://github.com/milaan9\"\u003e\u003cimg src=\"https://img.shields.io/static/v1?logo=github\u0026label=maintainer\u0026message=milaan9\u0026color=ff3300\" alt=\"Last Commit\"/\u003e\u003c/a\u003e \n\u003c!--\u003cimg src=\"https://badges.pufler.dev/created/milaan9/10_Python_Pandas_Module\" alt=\"Created\"/\u003e--\u003e\n\u003c!--\u003ca href=\"https://github.com/milaan9/10_Python_Pandas_Module/graphs/commit-activity\"\u003e\u003cimg src=\"https://img.shields.io/github/last-commit/milaan9/10_Python_Pandas_Module.svg?colorB=ff8000\u0026style=flat\" alt=\"Last Commit\"/\u003e \u003c/a\u003e--\u003e\n\u003ca href=\"https://github.com/milaan9/10_Python_Pandas_Module/pulse\" alt=\"Activity\"\u003e\u003cimg src=\"https://img.shields.io/github/commit-activity/m/milaan9/10_Python_Pandas_Module.svg?colorB=teal\u0026style=flat\" /\u003e\u003c/a\u003e \n\u003ca href=\"https://hits.seeyoufarm.com\"\u003e\u003cimg src=\"https://hits.seeyoufarm.com/api/count/incr/badge.svg?url=https%3A%2F%2Fgithub.com%2Fmilaan9%2F10_Python_Pandas_Module\u0026count_bg=%231DC92C\u0026title_bg=%23555555\u0026icon=\u0026icon_color=%23E7E7E7\u0026title=views\u0026edge_flat=false\"/\u003e\u003c/a\u003e\n\u003ca href=\"https://github.com/milaan9/10_Python_Pandas_Module/stargazers\"\u003e\u003cimg src=\"https://img.shields.io/github/stars/milaan9/10_Python_Pandas_Module.svg?colorB=1a53ff\" alt=\"Stars Badge\"/\u003e\u003c/a\u003e\n\u003ca href=\"https://github.com/milaan9/10_Python_Pandas_Module/network/members\"\u003e\u003cimg src=\"https://img.shields.io/github/forks/milaan9/10_Python_Pandas_Module\" alt=\"Forks Badge\"/\u003e \u003c/a\u003e\n\u003cimg src=\"https://img.shields.io/github/repo-size/milaan9/10_Python_Pandas_Module.svg?colorB=CC66FF\u0026style=flat\" alt=\"Size\"/\u003e\n\u003ca href=\"https://github.com/milaan9/10_Python_Pandas_Module/pulls\"\u003e\u003cimg src=\"https://img.shields.io/github/issues-pr/milaan9/10_Python_Pandas_Module.svg?colorB=yellow\u0026style=flat\" alt=\"Pull Requests Badge\"/\u003e\u003c/a\u003e\n\u003ca href=\"https://github.com/milaan9/10_Python_Pandas_Module/issues\"\u003e\u003cimg src=\"https://img.shields.io/github/issues/milaan9/10_Python_Pandas_Module.svg?colorB=yellow\u0026style=flat\" alt=\"Issues Badge\"/\u003e\u003c/a\u003e\n\u003cimg src=\"https://img.shields.io/github/languages/top/milaan9/10_Python_Pandas_Module.svg?colorB=996600\u0026style=flat\" alt=\"Language\"/\u003e\u003c/a\u003e \n\u003ca href=\"https://github.com/milaan9/10_Python_Pandas_Module/blob/main/LICENSE\"\u003e\u003cimg src=\"https://img.shields.io/badge/License-MIT-blueviolet.svg\" alt=\"MIT License\"/\u003e\u003c/a\u003e\n\u003c/p\u003e \n\u003c!--\u003cimg src=\"https://badges.pufler.dev/contributors/milaan9/01_Python_Introduction?size=50\u0026padding=5\u0026bots=true\" alt=\"milaan9\"/\u003e--\u003e\n\n\u003cp align=\"center\"\u003e \n\u003ca href=\"https://mybinder.org/v2/gh/milaan9/10_Python_Pandas_Module/HEAD\"\u003e\u003cimg src=\"https://mybinder.org/badge_logo.svg\" alt=\"binder\"/\u003e\u003c/a\u003e\n\u003ca href=\"https://githubtocolab.com/milaan9/10_Python_Pandas_Module\"\u003e\u003cimg src=\"https://colab.research.google.com/assets/colab-badge.svg\" alt=\"colab\"/\u003e\u003c/a\u003e\n\u003c/p\u003e \n \n# 10_Python_Pandas_Module\n\n\n## Introduction 👋\n\n### What is Pandas in Python?\n[![Pandas](https://img.shields.io/badge/Pandas-2C2D72?style=flat\u0026logo=pandas\u0026logoColor=white)](https://pandas.pydata.org/) is the most famous python library providing fast, flexible, and expressive data structures designed to make working with \"relational\" or \"labeled\" data both easy and intuitive. It aims to be the fundamental high-level building block for doing practical, **real world** data analysis in Python. Additionally, it has the broader goal of becoming **the most powerful and flexible open source data analysis / manipulation tool available in any language**. It is already well on its way towards this goal.\n\nIn Pandas, the data is usually utilized to support the statistical analysis in **SciPy**, plotting functions from **Matplotlib**, and machine learning algorithms in **Scikit-learn**.\n\n\n## Main Features\nHere are just a few of the things that pandas does well:\n\n  - Easy handling of [**missing data**][missing-data] (represented as `NaN`) in floating point as well as non-floating point data\n  - Size mutability: columns can be [**inserted and deleted**][insertion-deletion] from DataFrame and higher dimensional objects\n  - Automatic and explicit [**data alignment**][alignment]: objects can be explicitly aligned to a set of labels, or the user can simply\n    ignore the labels and let `Series`, `DataFrame`, etc. automatically align the data for you in computations\n  - Powerful, flexible [**group by**][groupby] functionality to perform split-apply-combine operations on data sets, for both aggregating\n    and transforming data\n  - Make it [**easy to convert**][conversion] ragged, differently-indexed data in other Python and NumPy data structures\n    into DataFrame objects\n  - Intelligent label-based [**slicing**][slicing], [**fancy indexing**][fancy-indexing], and [**subsetting**][subsetting] of\n    large data sets\n  - Intuitive [**merging**][merging] and [**joining**][joining] datasets\n  - Flexible [**reshaping**][reshape] and [**pivoting**][pivot-table] of datasets\n  - [**Hierarchical**][mi] labeling of axes (possible to have multiple labels per tick)\n  - Robust IO tools for loading data from [**flat files**][flat-files] (CSV and delimited), [**Excel files**][excel], [**databases**][db],\n    and saving/loading data from the ultrafast [**HDF5 format**][hdfstore]\n  - [**Time series**][timeseries]-specific functionality: date range generation and frequency conversion, moving window statistics,\n    moving window linear regressions, date shifting and lagging, etc.\n\n\n   [missing-data]: https://pandas.pydata.org/pandas-docs/stable/missing_data.html#working-with-missing-data\n   [insertion-deletion]: https://pandas.pydata.org/pandas-docs/stable/dsintro.html#column-selection-addition-deletion\n   [alignment]: https://pandas.pydata.org/pandas-docs/stable/dsintro.html?highlight=alignment#intro-to-data-structures\n   [groupby]: https://pandas.pydata.org/pandas-docs/stable/groupby.html#group-by-split-apply-combine\n   [conversion]: https://pandas.pydata.org/pandas-docs/stable/dsintro.html#dataframe\n   [slicing]: https://pandas.pydata.org/pandas-docs/stable/indexing.html#slicing-ranges\n   [fancy-indexing]: https://pandas.pydata.org/pandas-docs/stable/indexing.html#advanced-indexing-with-ix\n   [subsetting]: https://pandas.pydata.org/pandas-docs/stable/indexing.html#boolean-indexing\n   [merging]: https://pandas.pydata.org/pandas-docs/stable/merging.html#database-style-dataframe-joining-merging\n   [joining]: https://pandas.pydata.org/pandas-docs/stable/merging.html#joining-on-index\n   [reshape]: https://pandas.pydata.org/pandas-docs/stable/reshaping.html#reshaping-and-pivot-tables\n   [pivot-table]: https://pandas.pydata.org/pandas-docs/stable/reshaping.html#pivot-tables-and-cross-tabulations\n   [mi]: https://pandas.pydata.org/pandas-docs/stable/indexing.html#hierarchical-indexing-multiindex\n   [flat-files]: https://pandas.pydata.org/pandas-docs/stable/io.html#csv-text-files\n   [excel]: https://pandas.pydata.org/pandas-docs/stable/io.html#excel-files\n   [db]: https://pandas.pydata.org/pandas-docs/stable/io.html#sql-queries\n   [hdfstore]: https://pandas.pydata.org/pandas-docs/stable/io.html#hdf5-pytables\n   [timeseries]: https://pandas.pydata.org/pandas-docs/stable/timeseries.html#time-series-date-functionality\n\n\n### Core Components of Pandas Data Structure\nPandas have two core data structure components, and all operations are based on those two objects. Organizing data in a particular way is known as a data structure. Here are the two pandas data structures:\n\n* **Series**\n* **DataFrame**\n\n---\n\n## Table of contents 📋\n\n| **No.** | **Name** | \n| ------- | -------- | \n| 01 | **[Python_Pandas_DataFrame](https://github.com/milaan9/10_Python_Pandas_Module/blob/main/001_Python_Pandas_DataFrame.ipynb)** |\n|    | 1.1 **[001_Python_Pandas_DataFrame_from_Dictionary](https://github.com/milaan9/10_Python_Pandas_Module/blob/main/001_Python_Pandas_Methods/001_Python_Pandas_DataFrame_from_Dictionary.ipynb)** |\n|    | 1.2 **[Python_Pandas_DataFrame_from_List](https://github.com/milaan9/10_Python_Pandas_Module/blob/main/001_Python_Pandas_Methods/002_Python_Pandas_DataFrame_from_List.ipynb)** |\n|    | 1.3 **[Python_Pandas_DataFrame_head()_and_tail()](https://github.com/milaan9/10_Python_Pandas_Module/blob/main/001_Python_Pandas_Methods/003_Python_Pandas_DataFrame_head()_and_tail().ipynb)** |\n|    | 1.4 **[004_Python_Pandas_DataFrame_drop_columns](https://github.com/milaan9/10_Python_Pandas_Module/blob/main/001_Python_Pandas_Methods/004_Python_Pandas_DataFrame_drop_columns.ipynb)** |\n|    | 1.5 **[Python_Pandas_DataFrame_drop_duplicates](https://github.com/milaan9/10_Python_Pandas_Module/blob/main/001_Python_Pandas_Methods/005_Python_Pandas_DataFrame_drop_duplicates.ipynb)** |\n|    | 1.6 **[Python_Pandas_DataFrame_drop_columns_with_NA](https://github.com/milaan9/10_Python_Pandas_Module/blob/main/001_Python_Pandas_Methods/006_Python_Pandas_DataFrame_drop_columns_with_NA.ipynb)** |\n|    | 1.7 **[Python_Pandas_DataFrame_rename_columns](https://github.com/milaan9/10_Python_Pandas_Module/blob/main/001_Python_Pandas_Methods/007_Python_Pandas_DataFrame_rename_columns.ipynb)** |\n|    | 1.8 **[Python_Pandas_DataFrame_to_Python_dictionary](https://github.com/milaan9/10_Python_Pandas_Module/blob/main/001_Python_Pandas_Methods/008_Python_Pandas_DataFrame_to_Python_dictionary.ipynb)** |\n|    | 1.9 **[Python_Pandas_DataFrame_set_index](https://github.com/milaan9/10_Python_Pandas_Module/blob/main/001_Python_Pandas_Methods/009_Python_Pandas_DataFrame_set_index.ipynb)** |\n|    | 1.10 **[Python_Pandas_DataFrame_reset_index](https://github.com/milaan9/10_Python_Pandas_Module/blob/main/001_Python_Pandas_Methods/010_Python_Pandas_DataFrame_reset_index.ipynb)** |\n| 02 | **[Python_Pandas_Exercise_1](https://github.com/milaan9/10_Python_Pandas_Module/blob/main/002_Python_Pandas_Exercise_1.ipynb)** |\n| 03 | **[Python_Pandas_Exercise_2](https://github.com/milaan9/10_Python_Pandas_Module/blob/main/003_Python_Pandas_Exercise_2.ipynb)** |\n|    | **[automobile_data.csv](https://github.com/milaan9/10_Python_Pandas_Module/blob/main/automobile_data.csv)** |\n|    | **[pokemon_data.csv](https://github.com/milaan9/10_Python_Pandas_Module/blob/main/pokemon_data.csv)** |\n| 04 | **[Pandas Cheat Sheet Data Wrangling in Python.pdf](https://github.com/milaan9/10_Python_Pandas_Module/blob/main/Pandas%20Cheat%20Sheet%20Data%20Wrangling%20in%20Python.pdf)** |\n| 05 | **[Pandas Cheat Sheet for Data Science in Python.pdf](https://github.com/milaan9/10_Python_Pandas_Module/blob/main/Pandas%20Cheat%20Sheet%20for%20Data%20Science%20in%20Python.pdf)** |\n\nThese are online **read-only** versions. However you can **`Run ▶`**  all the codes **online** by clicking here ➞ \u003ca href=\"https://mybinder.org/v2/gh/milaan9/10_Python_Pandas_Module/HEAD\"\u003e\u003cimg src=\"https://mybinder.org/badge_logo.svg\" alt=\"binder\"/\u003e\u003c/a\u003e\n\n---\n\n## Install Pandas Module:\n\nOpen your [![Anaconda](https://img.shields.io/badge/Anaconda-342B029.svg?\u0026style=flate\u0026logo=anaconda\u0026logoColor=white)](https://www.anaconda.com/products/individual) Prompt \u003cimg alt=\"propmt\" src=\"https://img.shields.io/badge/-__-000000?style=flat-square\u0026logo=Plex\u0026logoColor=white\"\u003e and type and run the following command (individually):\n\n -       pip install pandas  \n \n\nOnce Installed now we can import it inside our python code.\n\n---  \n\n## Frequently asked questions ❔\n\n### How can I thank you for writing and sharing this tutorial? 🌷\n\nYou can \u003cimg src=\"https://img.shields.io/static/v1?label=%E2%AD%90 Star \u0026message=if%20useful\u0026style=style=flat\u0026color=blue\" alt=\"Star Badge\"/\u003e and \u003cimg src=\"https://img.shields.io/static/v1?label=%E2%B5%96 Fork \u0026message=if%20useful\u0026style=style=flat\u0026color=blue\" alt=\"Fork Badge\"/\u003e Starring and Forking is free for you, but it tells me and other people that it was helpful and you like this tutorial.\n\nGo [**`here`**](https://github.com/milaan9/10_Python_Pandas_Module) if you aren't here already and click ➞ **`✰ Star`** and **`ⵖ Fork`** button in the top right corner. You'll be asked to create a GitHub account if you don't already have one.\n\n---\n\n### How can I read this tutorial without an Internet connection? \u003cimg alt=\"GIF\" src=\"https://github.com/TheDudeThatCode/TheDudeThatCode/blob/master/Assets/hmm.gif\" width=\"20\" /\u003e\n\n1. Go [**`here`**](https://github.com/milaan9/10_Python_Pandas_Module) and click the big green ➞ **`Code`** button in the top right of the page, then click ➞ [**`Download ZIP`**](https://github.com/milaan9/10_Python_Pandas_Module/archive/refs/heads/main.zip).\n\n    ![Download ZIP](img/dnld_rep.png)\n\n2. Extract the ZIP and open it. Unfortunately I don't have any more specific instructions because how exactly this is done depends on which operating system you run.\n    \n3. Launch ipython notebook from the folder which contains the notebooks. Open each one of them\n  \n    **`Kernel \u003e Restart \u0026 Clear Output`**\n    \nThis will clear all the outputs and now you can understand each statement and learn interactively.\n\nIf you have git and you know how to use it, you can also clone the repository instead of downloading a zip and extracting it. An advantage with doing it this way is that you don't need to download the whole tutorial again to get the latest version of it, all you need to do is to pull with git and run ipython notebook again.\n\n---\n\n## Authors ✍️\n\nI'm Dr. Milaan Parmar and I have written this tutorial. If you think you can add/correct/edit and enhance this tutorial you are most welcome🙏\n\nSee [github's contributors page](https://github.com/milaan9/10_Python_Pandas_Module/graphs/contributors) for details.\n\nIf you have trouble with this tutorial please tell me about it by [Create an issue on GitHub](https://github.com/milaan9/10_Python_Pandas_Module/issues/new). and I'll make this tutorial better. This is probably the best choice if you had trouble following the tutorial, and something in it should be explained better. You will be asked to create a GitHub account if you don't already have one.\n\nIf you like this tutorial, please [give it a ⭐ star](https://github.com/milaan9/10_Python_Pandas_Module).\n\n---\n\n## Licence 📜\n\nYou may use this tutorial freely at your own risk. See [LICENSE](./LICENSE).\n\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmilaan9%2F10_python_pandas_module","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmilaan9%2F10_python_pandas_module","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmilaan9%2F10_python_pandas_module/lists"}