{"id":15063952,"url":"https://github.com/Fedesgh/Asteorid_RandomForest_Classifier","last_synced_at":"2025-10-05T06:31:05.823Z","repository":{"id":256644022,"uuid":"850407324","full_name":"Fedesgh/Asteorid_RandomForest_Classifier","owner":"Fedesgh","description":"Classifier model trained with unbalanced dataset ready for deployment ","archived":false,"fork":false,"pushed_at":"2024-10-24T22:44:37.000Z","size":14442,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2024-11-22T05:40:25.261Z","etag":null,"topics":["imblearn","pandas","pickle","seaborn","sklearn"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Fedesgh.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-08-31T17:24:46.000Z","updated_at":"2024-10-24T22:44:40.000Z","dependencies_parsed_at":"2024-09-12T07:16:37.322Z","dependency_job_id":null,"html_url":"https://github.com/Fedesgh/Asteorid_RandomForest_Classifier","commit_stats":null,"previous_names":["fedesgh/asteorid_clf","fedesgh/asteorid_randomforest_classifier"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Fedesgh%2FAsteorid_RandomForest_Classifier","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Fedesgh%2FAsteorid_RandomForest_Classifier/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Fedesgh%2FAsteorid_RandomForest_Classifier/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Fedesgh%2FAsteorid_RandomForest_Classifier/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Fedesgh","download_url":"https://codeload.github.com/Fedesgh/Asteorid_RandomForest_Classifier/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":235170461,"owners_count":18946982,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["imblearn","pandas","pickle","seaborn","sklearn"],"created_at":"2024-09-25T00:09:14.098Z","updated_at":"2025-10-05T06:31:00.752Z","avatar_url":"https://github.com/Fedesgh.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"## The project\nThe goal of the project is to create a classifier with **high recall**, and pickle it in order to be ready for deployment.\n\n\nThe dataset was downloaded from Kaggle: https://www.kaggle.com/datasets/ivansher/nasa-nearest-earth-objects-1910-2024\n\n\nWe must build a predictive model able to detect hazardous asteroids, for its nature we are intereset in **recall** in other words we prefer false alarms instead to dont detect such dangerous asteroids.\n\n\n\n\n\n## Models\n\nWe train using **GridSearchCV** severals models: **Kneboirghs** , **SVC** , **RandomForest**, **VotingClassifiers** \n\n\nAlso we use **SMOTETomek** due to the imbalance of the data: **0.127 of the data are hazardous asteroid**, with a total data of 338166 rows.\n\n\n\n![images/pairplot.png](images/pairplot.png)\n\n\n\n## The best model\n\nOur best model is **clf4** wich is **RandomForest** with 0.70 of recall and 0.82 of ROC_AUC\n\n![images/modelsummary.jpg](images/modelsummary.jpg)\n\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FFedesgh%2FAsteorid_RandomForest_Classifier","html_url":"https://awesome.ecosyste.ms/projects/github.com%2FFedesgh%2FAsteorid_RandomForest_Classifier","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FFedesgh%2FAsteorid_RandomForest_Classifier/lists"}