{"id":20276238,"url":"https://github.com/balajimohan18/loan-classification-datascience-project","last_synced_at":"2025-09-03T07:37:04.927Z","repository":{"id":179622505,"uuid":"663797835","full_name":"Balajimohan18/Loan-Classification-Datascience-Project","owner":"Balajimohan18","description":"This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.","archived":false,"fork":false,"pushed_at":"2023-07-08T07:17:31.000Z","size":1123,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-03-04T01:33:34.200Z","etag":null,"topics":["classification","data-analysis","data-cleaning","data-science","data-visualization","loan-prediction","loan-status","machine-learning","sql","supervised-learning"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Balajimohan18.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-07-08T05:58:05.000Z","updated_at":"2023-07-18T09:36:20.000Z","dependencies_parsed_at":null,"dependency_job_id":"ac51a59c-1a72-43b3-b756-a41dd464f3f5","html_url":"https://github.com/Balajimohan18/Loan-Classification-Datascience-Project","commit_stats":null,"previous_names":["balajimohan18/loan-classification-datascience-project"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/Balajimohan18/Loan-Classification-Datascience-Project","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Balajimohan18%2FLoan-Classification-Datascience-Project","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Balajimohan18%2FLoan-Classification-Datascience-Project/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Balajimohan18%2FLoan-Classification-Datascience-Project/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Balajimohan18%2FLoan-Classification-Datascience-Project/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Balajimohan18","download_url":"https://codeload.github.com/Balajimohan18/Loan-Classification-Datascience-Project/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Balajimohan18%2FLoan-Classification-Datascience-Project/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":263437341,"owners_count":23466367,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["classification","data-analysis","data-cleaning","data-science","data-visualization","loan-prediction","loan-status","machine-learning","sql","supervised-learning"],"created_at":"2024-11-14T13:12:58.305Z","updated_at":"2025-07-04T03:06:27.846Z","avatar_url":"https://github.com/Balajimohan18.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Loan-Classification-Datascience-Project\nThis project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.\n# Loan-Classification-Datascience-Project\nThis project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations. The project uses a loan applications which includes information about the individuals who are applying for the loan. With that data, training the model and getting the high accuracy model and using it predicting the future loan status with the needed data.\n## Problem Statement :\nLoan Lenders need to be able to accurately predict whether the loan will be repaid inorder to minimize the risk. By predicting the loan status of every individual, the lenders can able to follow the results of it to find out the differences among the behaviourial pattern of the customer. This is challenging problem because there are many factors that can influence the loan status.\n## Solution Approach :\nWith the help of Structured Query Language (SQL), transforming the data with the bunch of datasets which has different informations on Customer's behaviourial pattern. By the following structural mapping, joining the datasets to get the final data.  \n\n\n`image` ![Dataset SQL Formatting](https://github.com/shridhar1504/Loan-Classification-Datascience-Project/assets/113985416/984a2561-6d2f-4656-bfe2-e47f970c175d)\n\n\nMachine Learning can be used to build models that can predict the loan status such as A, B, C \u0026 D (i.e., Approved, Pending, Denied, Closed). These models are trained on historical data of loan applications and using this data which learns with the relationship between factors that impacts on status of the loan. Once a model is trained, it can be used to predict the loan status of the future applicants.\n## Observations :\nThe accuracy of the loan classification model can vary depending on the dataset that is used to train the model. The Bank had given applicant's details in individual CSV Files. For the predictive modeling,by using SQL; the datasets should be joined or merged using various statements which can have all the necessary column to form the final data in a csv format. The individual datasets and details of the datasets are as follows:\n\n * Account - The dataset has account id, district id, frequency \u0026 date.\n * Card - The dataset has card id, disposition id, type \u0026 issued.\n * Client - The dataset has client id , birth number alomg with district id.\n * Disp - The dataset has disposition id, client id, account id, type.\n * District - The dataset has different factors such as A1 - A16 which includes much datas than the other datasets but not explained in a well manner.\n * Loan - The dataset has loan id, account id, date, amount, duartion, payment \u0026 status.\n * Order - The dataset has order id, account id, bank to, account to, amount, k_symbol.\n * Transaction Data - The dataset has transaction id, account id, date, type, operation, amount, balance, k_symbol, bank \u0026 account.\n## Findings :\nThe most important factors in determining the status of the loan by the applicant's data such as debt to income ratio, length of credit history, whether they have bankruptcies or foreclosure on their record. The accuracy of loan classification models can be improved by using more data and by using more sophisticated machine learning algorithms.\n## Insights :\nLoan classification models can be used to improve the efficiency of loan lending by helping lenders to make more informed decisions about whether to approve or deny loan applications. Loan classification models can also be used to identify borrowers who are at a higher risk of default, which can help lenders to take steps to mitigate their risk.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbalajimohan18%2Floan-classification-datascience-project","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fbalajimohan18%2Floan-classification-datascience-project","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbalajimohan18%2Floan-classification-datascience-project/lists"}