Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/mardavsj/pandas-in-python

The fundamentals of Python Pandas Library.
https://github.com/mardavsj/pandas-in-python

analysis-tool data-manipulation pandas python

Last synced: about 1 month ago
JSON representation

The fundamentals of Python Pandas Library.

Awesome Lists containing this project

README

        



[![General badge](https://img.shields.io/badge/documentation-red.svg)](https://pandas.pydata.org/docs/)
[![license](https://img.shields.io/github/license/mardavsj/NumPy-in-Python.svg)](https://github.com/mardavsj/Pandas-in-Python/blob/main/LICENSE)

![pandas_logo](https://camo.githubusercontent.com/60ef9d8e7273373857bd6fb871678ef4fc5367433589be0da330d6972a305bda/68747470733a2f2f70616e6461732e7079646174612e6f72672f7374617469632f696d672f70616e6461735f77686974652e737667)

Pandas is a powerful and versatile library that simplifies tasks of data manipulation in Python. It is built on the top of the NumPy library which means that a lot of structures of NumPy are used or replicated in Pandas. It is well-suited for working with tabular data, such as spreadsheets or SQL tables.

Pandas is a powerful and open-source library Python library for data manipulation and analysis, providing data structures and functions for efficient operations.

[![Python](https://img.shields.io/badge/Python-14354C?style=for-the-badge&logo=python&logoColor=white&color=blue)](https://github.com/python/)
[![Pandas](https://img.shields.io/badge/pandas-%23150458.svg?style=for-the-badge&logo=pandas&logoColor=white&color=orange)](https://github.com/pandas-dev/pandas)

## Pre-requisites :
* Python
* Pip or Conda (depending on user)
## Installation :

Install Pandas with pip :

```bash
pip install pandas
```

Install Pandas with conda :

```bash
conda create -c conda-forge -n name_of_my_env python pandas

# This will create a minimal environment with only Python and pandas installed. To put your self inside this environment run :
source activate name_of_my_env
# On Windows :
activate name_of_my_env
```


## Why use Pandas ?

* Pandas allows us to analyze big data and make conclusions based on statistical theories.

* Pandas can clean messy data sets, and make them readable and relevant.

* Relevant data is very important in data science.

## Why Pandas over Excel ?

* Programmatic Data Manipulation :- Pandas allows for automation and complex data transformations through Python code, enabling efficient handling of large datasets and repetitive tasks.

* Efficiency with Large Datasets :- Pandas is optimized for working with large volumes of data, leveraging high-performance libraries like NumPy, ensuring faster processing compared to Excel, which may struggle with large datasets.

* Integration with Python Ecosystem :- Pandas seamlessly integrates with other Python libraries for data visualization, machine learning, and statistical analysis, enabling users to leverage the rich capabilities of Python for advanced analytics tasks, which may not be readily available within Excel.

## Video (Pandas Playlist) & Blog Tutorial :

[![Video_tutorial](https://img.shields.io/badge/YouTube-FF0000?style=for-the-badge&logo=youtube&logoColor=white)](https://www.youtube.com/playlist?list=PLjVLYmrlmjGdEE2jFpL71LsVH5QjDP5s4)
[![Blog_tutorial](https://img.shields.io/badge/Medium-12100E?style=for-the-badge&logo=medium&logoColor=black&color=white)](https://medium.com/analytics-vidhya/introduction-to-pandas-90b75a5c2278)