Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/madhurimarawat/data-wrangling

This repository contains experiments on data wrangling techniques, focusing on methods for handling missing values, filtering, aggregation, and more.
https://github.com/madhurimarawat/data-wrangling

data-filtering data-wrangling handling-missing-values python

Last synced: about 13 hours ago
JSON representation

This repository contains experiments on data wrangling techniques, focusing on methods for handling missing values, filtering, aggregation, and more.

Awesome Lists containing this project

README

        

# Data-Wrangling
This repository contains experiments on data wrangling techniques, focusing on methods for handling missing values, filtering, aggregation, and more.

## Python

Python is a high-level, interpreted programming language widely used in data science for data manipulation, analysis, and visualization. Libraries such as Pandas and NumPy provide powerful tools for data wrangling, including handling missing values, filtering, and reshaping datasets.

## Directory Structure πŸ“‚

```
Data-Wrangling/
β”‚
β”œβ”€β”€ Experiment 1 - Handling Missing Values/
β”‚ β”œβ”€β”€ Handling_Missing_Values.ipynb
β”‚
β”œβ”€β”€ Experiment 2 - Data Filtering/
β”‚ β”œβ”€β”€ Data_Filtering.ipynb
β”‚ β”œβ”€β”€ Experiment 2 Document.docx
β”‚
β”œβ”€β”€ Experiment 3 - Data Aggregation/
β”‚ β”œβ”€β”€ Data_Aggregation.ipynb
β”‚ β”œβ”€β”€ Experiment 3 Document.docx/
β”‚
β”œβ”€β”€ Experiment 4 - Data Concatenation/
β”‚ β”œβ”€β”€ Data_Concatenation.ipynb
β”‚
β”œβ”€β”€ Experiment 5 - Data Reshaping/
β”‚ β”œβ”€β”€ Data_Reshaping.ipynb
β”‚
β”œβ”€β”€ Experiment 6 - Data Sampling/
β”‚ β”œβ”€β”€ Data_Sampling.ipynb
β”‚
β”œβ”€β”€ Experiment 7 - Data Conversion/
β”‚ β”œβ”€β”€ Data_Conversion.ipynb
β”‚
└── README.md
```

## Table Of Contents πŸ“” πŸ”– πŸ“‘

### 1. [Handling Missing Values](https://github.com/madhurimarawat/Data-Wrangling/tree/main/Experiment%201)

**Description:**
Identify and fill missing values in a dataset using methods such as mean imputation or forward/backward filling to ensure data completeness and accuracy.

### 2. [Data Filtering](https://github.com/madhurimarawat/Data-Wrangling/tree/main/Experiment%202)

**Description:**
Filter rows or columns based on specified criteria, such as removing outliers or selecting data within a certain range to refine datasets for analysis.

### 3. [Data Aggregation](https://github.com/madhurimarawat/Data-Wrangling/tree/main/Experiment%203)

**Description:**
Aggregate data by grouping rows based on specific attributes and computing summary statistics, such as mean, median, count, or sum. This helps to summarize large datasets for easier analysis.

### 4. [Data Concatenation](https://github.com/madhurimarawat/Data-Wrangling/tree/main/Experiment%204)

**Description:**
Concatenate multiple datasets either along rows or columns to create a unified dataset. This method is useful when merging datasets from different sources or appending new data to an existing dataset.

### 5. [Data Reshaping](https://github.com/madhurimarawat/Data-Wrangling/tree/main/Experiment%205)

**Description:**
Reshape data by pivoting, stacking, or unstacking to convert between wide and long formats. This technique allows for better organization and analysis of data with multiple variables.

### 6. [Data Sampling](https://github.com/madhurimarawat/Data-Wrangling/tree/main/Experiment%206)

**Description:**
Randomly sample rows or columns from a dataset to create a smaller subset for analysis. Sampling is useful for exploratory data analysis, testing models, or handling large datasets efficiently.

### 7. [Data Conversion](https://github.com/madhurimarawat/Data-Wrangling/tree/main/Experiment%207)

**Description:**
Convert data types of columns, such as changing categorical variables to numerical representations or converting numerical values into categories, enabling better processing and analysis of the data.

---

## Thanks for Visiting πŸ˜„

- Drop a 🌟 if you find this repository useful.


- If you have any doubts or suggestions, feel free to reach me.


πŸ“« How to reach me: Β  [![Linkedin Badge](https://img.shields.io/badge/-madhurima-blue?style=flat&logo=Linkedin&logoColor=white)](https://www.linkedin.com/in/madhurima-rawat/) Β  Β 
Mail IllustrationπŸ“«


- **Contribute and Discuss:** Feel free to open issues πŸ›, submit pull requests πŸ› οΈ, or start discussions πŸ’¬ to help improve this repository!