{"id":15063909,"url":"https://github.com/Fedesgh/Building_Credit_Risk_Classifier_Using_Bagging_Kneighbors","last_synced_at":"2025-10-05T00:31:12.460Z","repository":{"id":256633893,"uuid":"855837717","full_name":"Fedesgh/Building_Credit_Risk_Classifier_Using_Bagging_Kneighbors","owner":"Fedesgh","description":"Problem statment about modeling target vector and attempt to improve metrics","archived":false,"fork":false,"pushed_at":"2024-10-25T21:55:12.000Z","size":46212,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-09-07T01:22:47.412Z","etag":null,"topics":["feature-selection","imblearn","information-value","sklearn"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Fedesgh.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-09-11T14:37:47.000Z","updated_at":"2024-10-25T21:55:15.000Z","dependencies_parsed_at":null,"dependency_job_id":"5f662892-8062-42d5-9bf1-cb26ac036f61","html_url":"https://github.com/Fedesgh/Building_Credit_Risk_Classifier_Using_Bagging_Kneighbors","commit_stats":null,"previous_names":["fedesgh/credit_risk_clf","fedesgh/building_credit_risk_classifier_using_bagging_kneighbors"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/Fedesgh/Building_Credit_Risk_Classifier_Using_Bagging_Kneighbors","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Fedesgh%2FBuilding_Credit_Risk_Classifier_Using_Bagging_Kneighbors","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Fedesgh%2FBuilding_Credit_Risk_Classifier_Using_Bagging_Kneighbors/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Fedesgh%2FBuilding_Credit_Risk_Classifier_Using_Bagging_Kneighbors/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Fedesgh%2FBuilding_Credit_Risk_Classifier_Using_Bagging_Kneighbors/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Fedesgh","download_url":"https://codeload.github.com/Fedesgh/Building_Credit_Risk_Classifier_Using_Bagging_Kneighbors/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Fedesgh%2FBuilding_Credit_Risk_Classifier_Using_Bagging_Kneighbors/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":278395874,"owners_count":25979685,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-04T02:00:05.491Z","response_time":63,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["feature-selection","imblearn","information-value","sklearn"],"created_at":"2024-09-25T00:08:38.962Z","updated_at":"2025-10-05T00:31:07.451Z","avatar_url":"https://github.com/Fedesgh.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"The motivation for this repository are the difficulties that the dataset present when we define the Target and features.\nAlso , there are several attempts  in kaggle with low metrics particularly when we restrict the training set to features with information before the loan was granted and we want try to improve it:\n\nhttps://www.kaggle.com/datasets/devanshi23/loan-data-2007-2014/data \n\n## Problems at defining the target\n\n\"loan_status\" has the followings values: \n\u003col\u003e\n\u003cli\u003eCurrent                                                \u003c/li\u003e\n\u003cli\u003eFully Paid                                             \u003c/li\u003e\n\u003cli\u003eCharged Off                                             \u003c/li\u003e\n\u003cli\u003eLate (31-120 days)                                       \u003c/li\u003e\n\u003cli\u003eIn Grace Period                                          \u003c/li\u003e\n\u003cli\u003eDoes not meet the credit policy. Status:Fully Paid       \u003c/li\u003e\n\u003cli\u003eLate (16-30 days)                                        \u003c/li\u003e\n\u003cli\u003eDefault                                                   \u003c/li\u003e\n\u003cli\u003eDoes not meet the credit policy. Status:Charged Off       \u003c/li\u003e\n\u003c/ol\u003e\n\nThe main point we must consider is that the values belong to differents moments in the loan life span. \n\nThose that belong to an end of the Loan:\n\u003col\u003e\n\u003cli\u003eFully Paid                                             \u003c/li\u003e\n\u003cli\u003eCharged Off                                             \u003c/li\u003e\n\u003cli\u003eDoes not meet the credit policy. Status:Fully Paid       \u003c/li\u003e\n\u003cli\u003eDefault                                                   \u003c/li\u003e\n\u003cli\u003eDoes not meet the credit policy. Status:Charged Off       \u003c/li\u003e\n\u003c/ol\u003e\n\nMiddle term of a loan:\n\n\u003col\u003e\n\u003cli\u003eCurrent                                                \u003c/li\u003e\n\u003cli\u003eLate (31-120 days)                                       \u003c/li\u003e\n\u003cli\u003eLate (16-30 days)                                        \u003c/li\u003e\n\u003c/ol\u003e\n\n\nwhile In Grace Period belongs to the beginning. \n\nOn top of this we should consider:\n\n\u003col\u003e\n\u003cli\u003eAll the loans regardless its end, were previously in time \"In Period Grace\"\u003c/li\u003e\n\u003cli\u003eAll the loans regardless its end, were previously in time Current and/or Late\u003c/li\u003e\n\u003c/ol\u003e\n\nFIRST MODEL (STRICT): \n\n\"Good loans\": \n\u003col\u003e\n\u003cli\u003eFully Paid                                             \u003c/li\u003e\n\u003c/ol\u003e\n\n\"Bad loans\": \n\n\u003col\u003e\n\u003cli\u003eCharged Off                                             \u003c/li\u003e\n\u003cli\u003eDoes not meet the credit policy. Status:Fully Paid       \u003c/li\u003e\n\u003cli\u003eDefault                                                   \u003c/li\u003e\n\u003cli\u003eDoes not meet the credit policy. Status:Charged Off       \u003c/li\u003e\n\u003c/ol\u003e\n\nWe just consider ends of loans categorys in the target, and we should consider only features in X_train set that belong before\nthe loan was granted.","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FFedesgh%2FBuilding_Credit_Risk_Classifier_Using_Bagging_Kneighbors","html_url":"https://awesome.ecosyste.ms/projects/github.com%2FFedesgh%2FBuilding_Credit_Risk_Classifier_Using_Bagging_Kneighbors","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FFedesgh%2FBuilding_Credit_Risk_Classifier_Using_Bagging_Kneighbors/lists"}