Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/zescalante/data1030-final-project

Final project for DATA1030
https://github.com/zescalante/data1030-final-project

data-science machine-learning scikit-learn

Last synced: 7 days ago
JSON representation

Final project for DATA1030

Awesome Lists containing this project

README

        

# Binary Classification in a Galaxy Cluster Field Using Machine Learning
Fall 2024 - DATA 1030 - Final Project

_Author: Zacharias Escalante_

This project entails analyzing a catalog of source information contained in the field of the Abell 3266 galaxy cluster field, created using data from the LoVoCCS survey and other external surveys. Exploratory data analysis is first carried out, followed by the engineering of new features, as well as data splitting and preprocessing in preparation for passing into a machine learing pipeline.

## Contents

`Galaxies_Binary_Classif.ipynb`: Jupyter notebook containing data analysis (EDA, Splitting, Preprocessing).

`data_description.txt`: A Brief summary of columns (features) in the dataset.

`DATA_1030_Midterm_Presentation.pdf`: Midterm presentation slides explaining origin of dataset, EDA splitting, and preprocessing.