Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/tiarmdhnt/titanic-classification-pipeline

This repository implements a classification pipeline for the Titanic dataset using Apache Spark. It covers ETL, data preprocessing, and machine learning model building with algorithms like Logistic Regression, Decision Tree, Random Forest, and Gradient-Boosted Tree. Results're presented through visualizations to support data-driven insights.
https://github.com/tiarmdhnt/titanic-classification-pipeline

apache-spark big-data decision-tree etl-pipeline gradient-boosted-trees hdfs logistic-regression machine-learning pyspark random-forest

Last synced: 18 days ago
JSON representation

This repository implements a classification pipeline for the Titanic dataset using Apache Spark. It covers ETL, data preprocessing, and machine learning model building with algorithms like Logistic Regression, Decision Tree, Random Forest, and Gradient-Boosted Tree. Results're presented through visualizations to support data-driven insights.

Awesome Lists containing this project

README

        

# Titanic-Classification-Pipeline
This repository implements a classification pipeline for the Titanic dataset using Apache Spark. It covers ETL, data preprocessing, and machine learning model building with algorithms like Logistic Regression, Decision Tree, Random Forest, and Gradient-Boosted Tree. Results're presented through visualizations to support data-driven insights.