{"id":26868898,"url":"https://github.com/cintia0528/data_science-supervised_machine_learning_classification_housing","last_synced_at":"2025-07-26T14:37:46.682Z","repository":{"id":209511929,"uuid":"724166892","full_name":"Cintia0528/Data_Science-Supervised_Machine_Learning_Classification_Housing","owner":"Cintia0528","description":"The projects aim is to find the to best ML algorithm evaluated on its efficiency in predicting whether homes should be classified as expensive or not expensive.","archived":false,"fork":false,"pushed_at":"2023-12-01T14:49:28.000Z","size":251,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-03-31T05:35:17.948Z","etag":null,"topics":["classification","lazypredict","machinelearning-python","supervised-machine-learning","xgboost-algorithm"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Cintia0528.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-11-27T14:30:06.000Z","updated_at":"2024-02-26T05:34:19.000Z","dependencies_parsed_at":"2023-11-27T19:41:28.615Z","dependency_job_id":"19fdc09d-6ddc-4bc5-a793-3551d40f01f8","html_url":"https://github.com/Cintia0528/Data_Science-Supervised_Machine_Learning_Classification_Housing","commit_stats":null,"previous_names":["cintia0528/project-6-supervised-machine-learning---classification","cintia0528/data_science-supervised_machine_learning_classification_housing"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Cintia0528%2FData_Science-Supervised_Machine_Learning_Classification_Housing","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Cintia0528%2FData_Science-Supervised_Machine_Learning_Classification_Housing/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Cintia0528%2FData_Science-Supervised_Machine_Learning_Classification_Housing/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Cintia0528%2FData_Science-Supervised_Machine_Learning_Classification_Housing/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Cintia0528","download_url":"https://codeload.github.com/Cintia0528/Data_Science-Supervised_Machine_Learning_Classification_Housing/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246423503,"owners_count":20774796,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["classification","lazypredict","machinelearning-python","supervised-machine-learning","xgboost-algorithm"],"created_at":"2025-03-31T05:35:19.938Z","updated_at":"2025-03-31T05:35:21.285Z","avatar_url":"https://github.com/Cintia0528.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Supervised Machine Learning - Classification\n## Goal\nTo classify properties into \"Expensive\" / \"Not Expensive\" categories with the help of Supervised Machine Learning.  \n\n## Overview \nWe are interested in making better investment decisions, and hence evaluating properties based on 70+ features, whether they qualify as expensive or inexpensive properties.\n\n## Context\nTrying out and fine-tuning a variety of Machine Learning models to get the best prediction\n\n1. Is our Machine Learning model predicting the value of properties successfully?\n2. What type of errors are most prone for each of the models?\n\n### Task: \n* Import database of over 1500 properties\n* Explore, analyze and clean over 70 features\n* Try and fine-tune ML models for the best outcome\n\n## Deliverables\nThe **Google Colab Notebook** for trying out different ML algorithms is found [here](https://github.com/Cintia0528/Project-6-Supervised-Machine-Learning---Classification/blob/1c9d84d012a3df27d01b413010a3be04dec79acf/5_b_Housing_Model_Selection_.ipynb).\nFurther Machine Learning experimentation with LazyPredict and VotingClassifier is found [here](https://github.com/Cintia0528/Project-6-Supervised-Machine-Learning--Classification-/blob/aa13379741c4444573b21000712825789fd7ef70/5_c_Housing_Model_Selection_LazyPredict%20(1).ipynb), with a supporting Medium article [here](https://medium.com/@ubp0528/another-ml-puzzle-decoding-the-factors-behind-expensive-homes-a6f096aa91e1).\n\n## Skills \u0026 Tools\n1. Data Reading \u0026 Cleaning \n2. Data Splitting \n3. Building a Preprocessor\n4. Modelling ( Decision Tree, KNN, Random Forest, XGBoost)\n5. Fine Tuning\n6. Error Analysis\n\n## Further Analysis\n1. Perfecting the model with Lazy predict\n2. Pooling individual models' strength with Voting Classifier\n\nNote: In the notebook the Lazypredict + VotingClassifier combo gave us approximately 95%, but when applied to brand new dataset via a Streamlit application it had the highest accuracy with over 97%.  \n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcintia0528%2Fdata_science-supervised_machine_learning_classification_housing","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fcintia0528%2Fdata_science-supervised_machine_learning_classification_housing","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcintia0528%2Fdata_science-supervised_machine_learning_classification_housing/lists"}