Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/yaminibhole/data_cleaning_and_eda
https://github.com/yaminibhole/data_cleaning_and_eda
Last synced: about 1 month ago
JSON representation
- Host: GitHub
- URL: https://github.com/yaminibhole/data_cleaning_and_eda
- Owner: yaminibhole
- Created: 2023-12-11T04:07:31.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-01-16T13:49:54.000Z (12 months ago)
- Last Synced: 2024-01-18T03:09:04.723Z (12 months ago)
- Language: Jupyter Notebook
- Size: 117 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# DATA CLEANING AND EXPLORATORY DATA ANALYSIS(EDA)
This repository contains code for data cleaning and exploratory data analysis (EDA) on the Titanic dataset using the 'train.csv' file. The dataset includes information about passengers such as their survival status, class, age, and other attributes.## Overview
- `Task2.ipynb`: Jupyter Notebook containing the Python code for data cleaning and EDA.
- `train.csv`: The dataset used for analysis.## Prerequisites
Make sure you have the following installed:
- Python 3.x
- Jupyter Notebook
- Required Python libraries (pandas, matplotlib, seaborn)Data Cleaning and EDA
The Task2.ipynb notebook includes step-by-step code for:
1. Loading the 'train.csv' dataset
2. Handling missing values
3. Exploring basic statistics
4. Visualizing the distribution of numerical features
5. Exploring survival rates
6. Analyzing survival by class, sex, and age
7. Creating visualizations for better understanding
8. Generating a correlation heatmap