Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/mgobeaalcoba/survival_predictor_on_the_titanic_scikit_learn

Titanic Survival Predictor using Scikit-Learn: Machine learning model and analysis to predict passenger survival on the Titanic based on historical data.
https://github.com/mgobeaalcoba/survival_predictor_on_the_titanic_scikit_learn

matplotlib numpy pandas python3 scikit-learn seaborn titanic-dataset titanic-kaggle titanic-survival-prediction

Last synced: about 1 month ago
JSON representation

Titanic Survival Predictor using Scikit-Learn: Machine learning model and analysis to predict passenger survival on the Titanic based on historical data.

Awesome Lists containing this project

README

        

# Titanic Survival Predictor using Scikit-Learn

![Titanic](https://upload.wikimedia.org/wikipedia/commons/thumb/6/6e/St%C3%B6wer_Titanic.jpg/800px-St%C3%B6wer_Titanic.jpg)

This project was created in Google Colab and uses machine learning techniques to predict passenger survival on the Titanic based on historical data. The model achieved an accuracy of 81.5%.

## Overview

The Titanic Survival Predictor project aims to demonstrate the power of machine learning in predicting survival outcomes on the famous RMS Titanic. It utilizes popular Python libraries such as Pandas, NumPy, Matplotlib, Seaborn, and Scikit-Learn.

## Dataset

The dataset used in this project is the [Titanic dataset](https://www.kaggle.com/c/titanic) from Kaggle. It contains passenger information such as age, gender, class, and whether or not they survived.

## Methodology

1. **Data Preprocessing**: Data cleaning, handling missing values, and feature engineering.

2. **Exploratory Data Analysis (EDA)**: Visualization using Matplotlib and Seaborn to gain insights into the data.

3. **Machine Learning**: Building a predictive model using Scikit-Learn classifiers. The model achieved an accuracy of 81.5%.

4. **Model Evaluation**: Assessing model performance through accuracy and other relevant metrics.