https://github.com/cyberfantics/titanic-dataset-analysis
https://github.com/cyberfantics/titanic-dataset-analysis
Last synced: 5 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/cyberfantics/titanic-dataset-analysis
- Owner: cyberfantics
- Created: 2024-08-23T01:20:46.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2024-10-01T17:59:38.000Z (over 1 year ago)
- Last Synced: 2025-04-09T23:13:34.855Z (about 1 year ago)
- Language: Jupyter Notebook
- Size: 2.34 MB
- Stars: 3
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Titanic Dataset Analysis
## Overview
This project analyzes the Titanic dataset to explore factors affecting passenger survival. The dataset includes features such as class, sex, fare, and survival status.
## Key Steps
1. **Data Cleaning**:
- Removed columns with many missing values (`Cabin`, `Age`).
- Dropped unnecessary columns (`PassengerId`, `Name`, `Ticket`).
2. **Exploration & Visualization**:
- Analyzed survival by gender, class, and embarkation point.
- Visualized sibling distribution and its impact on survival.
## Findings
- **Gender**: Higher survival rates for females.
- **Class**: Higher-class passengers had better survival chances.
- **Embarkation**: Survival patterns varied by embarkation point.
## Usage
1. Clone the repository:
```bash
git clone https://github.com/cyberfantics/pdf-merger
```
2. Install dependencies:
```pip install pandas matplotlib seaborn```
## Contact
Presented by: Mansoor Bukhari