{"id":20443035,"url":"https://github.com/sardhendu/data-science-projects","last_synced_at":"2025-04-12T23:45:14.888Z","repository":{"id":92417702,"uuid":"84698617","full_name":"Sardhendu/Data-Science-Projects","owner":"Sardhendu","description":"{PySpark, R, Python}: Several Data Science projects ","archived":false,"fork":false,"pushed_at":"2018-05-14T03:37:20.000Z","size":108404,"stargazers_count":15,"open_issues_count":0,"forks_count":8,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-04-12T23:44:32.806Z","etag":null,"topics":["autoencoder","bayesian-methods","boosting-algorithms","classification","credit-card-fraud","deep-neural-networks","linear-regression","logistic-regression","machine-learning-algorithms","pyspark","python-3","random-forest","regression","svm-classifier","tensorflow"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Sardhendu.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2017-03-12T04:03:58.000Z","updated_at":"2025-04-06T20:49:39.000Z","dependencies_parsed_at":"2023-04-03T16:33:01.519Z","dependency_job_id":null,"html_url":"https://github.com/Sardhendu/Data-Science-Projects","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Sardhendu%2FData-Science-Projects","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Sardhendu%2FData-Science-Projects/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Sardhendu%2FData-Science-Projects/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Sardhendu%2FData-Science-Projects/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Sardhendu","download_url":"https://codeload.github.com/Sardhendu/Data-Science-Projects/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248647257,"owners_count":21139081,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["autoencoder","bayesian-methods","boosting-algorithms","classification","credit-card-fraud","deep-neural-networks","linear-regression","logistic-regression","machine-learning-algorithms","pyspark","python-3","random-forest","regression","svm-classifier","tensorflow"],"created_at":"2024-11-15T09:43:51.499Z","updated_at":"2025-04-12T23:45:14.883Z","avatar_url":"https://github.com/Sardhendu.png","language":"Jupyter Notebook","readme":"# Data-Science-Projects:\n\n\n\n## Techniques:\n\t\n**Feature Selection**:\n\n   * PCA (Principal Component Analysis)\n   * AIC (Akiake Information criterion)\n   * BIC (Bayesian Information criterion)\n   * LASSO (Least Absolute Shrinkage and Selection Operator)\n\n\n\n1. [Credit Card Fraud Detection](https://github.com/Sardhendu/Data-Science-Projects/tree/master/CreditCardFraudDetection): {Python: Sckit-learn, Tensorflow, R} (Ongoing\n\n * Models:\n    1. Random Forest\n    2. Gradient Boosting\n    3. XGBoost\n    4. Deep Neural Nets\n    5. Autoencoders\n    6. Bayesian Methods  \n\n2. [Diabetic-Readmission Analysis](https://github.com/Sardhendu/Data-Science-Projects/blob/master/Diabetic-Readmission/DiabeticReadmission-Spark.ipynb): {PySpark, R}\n\n * Classification:\n    1. GLM {RIDGE/LASSO/ELNET}\n    2. Random Forests\n\n\n3. [Crime Prediction](https://github.com/Sardhendu/Data-Science-Projects/blob/master/Crime-Prediction/crimePrediction.ipynb): {Python: Sckit-learn}\n\n * Regression:\n    1. Linear Regression\n    2. Polynomial Regression\n\n * Classification:\n    1. Decision Trees\n    2. Gaussian Naive Bayes\n    3. Support Vector Machines, Linear SVC, POLY, RBF\n    4. Random Forests\n\n4. [Credit default](https://github.com/Sardhendu/Data-Science-Projects/blob/master/Credit-Defaulters/CreditDefault.ipynb): {R}:\n\n * Classification:\n    1. Logistic Regression (GLM): RIDGE/LASSO\n    2. Naive Bayes\n    3. Decision Trees\n    4. Random Forests      \n    \n5. [Loan Default](https://github.com/Sardhendu/Data-Science-Projects/tree/master/Loan-Defaults): {R}\n \n * Classification:\n    1.  GLM (Generalized Linear Model)\n\n--\u003e Data {source URL} : \n\t\t1. http://archive.ics.uci.edu/ml/\n\t\t2. https://www.lendingclub.com/info/download-data.action\n\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsardhendu%2Fdata-science-projects","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsardhendu%2Fdata-science-projects","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsardhendu%2Fdata-science-projects/lists"}