{"id":22627745,"url":"https://github.com/annaanastasy/mushroom-binary-classification-eda-ml","last_synced_at":"2025-03-29T03:44:47.261Z","repository":{"id":253144900,"uuid":"841586883","full_name":"AnnaAnastasy/Mushroom-Binary-Classification-EDA-ML","owner":"AnnaAnastasy","description":"Explored and modeled a competition dataset of mushroom species, focusing on data cleaning, exploratory data analysis, and building machine learning models for accurate classification of edible and poisonous mushrooms.","archived":false,"fork":false,"pushed_at":"2024-11-21T14:54:46.000Z","size":4260,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-02-03T13:43:55.906Z","etag":null,"topics":["binary-classification","data","data-cleaning-and-preprocessing","data-science","exploratory-data-analysis","machine-learning-algorithms","xgboost-classifier"],"latest_commit_sha":null,"homepage":"https://www.kaggle.com/code/annastasy/ps4e8-data-cleaning-and-eda-of-mushrooms","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/AnnaAnastasy.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-08-12T17:47:57.000Z","updated_at":"2024-11-21T14:58:04.000Z","dependencies_parsed_at":"2024-11-06T20:21:01.471Z","dependency_job_id":"dad6f1fb-6f35-4326-ac02-ce72c9b01fcf","html_url":"https://github.com/AnnaAnastasy/Mushroom-Binary-Classification-EDA-ML","commit_stats":null,"previous_names":["annaanastasy/binary-prediction-of-poisonous-mushrooms"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AnnaAnastasy%2FMushroom-Binary-Classification-EDA-ML","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AnnaAnastasy%2FMushroom-Binary-Classification-EDA-ML/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AnnaAnastasy%2FMushroom-Binary-Classification-EDA-ML/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AnnaAnastasy%2FMushroom-Binary-Classification-EDA-ML/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/AnnaAnastasy","download_url":"https://codeload.github.com/AnnaAnastasy/Mushroom-Binary-Classification-EDA-ML/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246135741,"owners_count":20729056,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["binary-classification","data","data-cleaning-and-preprocessing","data-science","exploratory-data-analysis","machine-learning-algorithms","xgboost-classifier"],"created_at":"2024-12-09T01:16:07.638Z","updated_at":"2025-03-29T03:44:47.241Z","avatar_url":"https://github.com/AnnaAnastasy.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Mushroom Classification: Data Cleaning, EDA, and Machine Learning Model\n\nThis project was part of a competitive data science challenge, aiming to classify mushroom species as edible or poisonous based on various features. The focus was on rigorous data cleaning, exploratory data analysis (EDA), and extracting meaningful insights to support accurate classification models.\n\n## Project Overview\nMushrooms are a fascinating and diverse group of organisms, but some species can be highly toxic. This competition provided an opportunity to analyze a detailed dataset and uncover patterns that differentiate edible mushrooms from poisonous ones. Our efforts focused on:\n- **Data Cleaning**:  Addressed missing or inconsistent values and ensured data readiness for analysis.\n- **EDA**: Used advanced visualization techniques to uncover trends, patterns, and feature correlations.\n- **Machine Learning Models**: Built and evaluated multiple models to classify mushrooms as edible or poisonous.\n\n## Dataset\nThe dataset used for this project is available on Kaggle. Please download it from the following [link](https://www.kaggle.com/competitions/playground-series-s4e8).\nAfter downloading, ensure the dataset is placed in the same folder as the project.\n\n## Results\n\n### Exploratory Data Analysis:\n- Identified key features influencing mushroom edibility (e.g., odor, gill size).\n- Visualized feature distributions and correlations.\n\n### Model Performance:\n- Achieved high accuracy with Gradient Boosting model (**XGBoost**).\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fannaanastasy%2Fmushroom-binary-classification-eda-ml","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fannaanastasy%2Fmushroom-binary-classification-eda-ml","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fannaanastasy%2Fmushroom-binary-classification-eda-ml/lists"}