Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/farscent/oemoem2024-dsai-challenge

My submission for Omah-Ti's DSAI Data Analysis Challenge.
https://github.com/farscent/oemoem2024-dsai-challenge

exploratory-data-analysis jupyter-notebook linear-regression machine-learning python random-forest

Last synced: 23 days ago
JSON representation

My submission for Omah-Ti's DSAI Data Analysis Challenge.

Awesome Lists containing this project

README

        

# Oemoem2024-DSAI-Challenge
My submission for Omah-Ti UGM's DSAI Data Analysis Challenge.

The dataset given were about Clams and their physical characteristics. I made a predictive model that can predict their age.
With my submission, I obtained the Best Project winner in the Data Science and Artificial Intelligence Class of OemOem.

## Project Overview

This notebook explores a dataset containing physical measurements of clams, such as length, diameter, height, and weight. The primary objective is to predict the age of the clams using these variables through **Exploratory Data Analysis (EDA)** and **machine learning models**.

---

## Key Features

1. **Exploratory Data Analysis (EDA)**:
- Examines the relationships between physical measurements and age.
- Visualizes distributions and correlations between features.
2. **Feature Engineering**:
- Combines dimensions (e.g., length, diameter, height) to calculate volume.
- Encodes categorical variables such as gender for model training.
3. **Data Preprocessing**:
- Handles missing values and outliers.
- Scales numerical features for model compatibility.
4. **Modeling**:
- Builds machine learning models to predict clam age.
- Evaluates model performance using metrics like accuracy and F1-score.

---