https://github.com/astrosica/data-science-portfolio
https://github.com/astrosica/data-science-portfolio
Last synced: 5 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/astrosica/data-science-portfolio
- Owner: astrosica
- Created: 2024-06-14T20:55:49.000Z (12 months ago)
- Default Branch: main
- Last Pushed: 2024-07-03T16:30:41.000Z (11 months ago)
- Last Synced: 2024-11-15T09:28:59.040Z (7 months ago)
- Language: Jupyter Notebook
- Size: 48.4 MB
- Stars: 3
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Data Science Portfolio
This repository contains my data science portfolio projects for continuous learning.
## Data Projects### Machine Learning (Python)
* [Credit Card Application Prediction](https://github.com/astrosica/data-science-portfolio/blob/main/Machine%20Learning/Projects/Credit%20Card%20Approval/Credit%20Card%20Approval.ipynb): Developed a prediction model that determines whether a credit card application will be approved or denied using Logistic Regression, KNN, and Random Forest models in Python.
### Reporting and Dashboards (SQL, Tableau, Excel)
* [Insurance Analysis](https://github.com/astrosica/data-science-portfolio/tree/main/Reporting%20and%20Dashboards/Insurance%20Claims%20Analysis): Developed an interactive Tableau dashboard to report and analyze 70K insurance claims, providing actionable insights to guide future marketing and budget decisions as a PowerPoint presentation.
* [Marketing Analysis](https://github.com/astrosica/data-science-portfolio/tree/main/Reporting%20and%20Dashboards/e-Commerce%20Marketing%20Analysis): Performed exploratory analysis and data validation of 100K sales records for a sample e-commerce company using Excel and SQL (Google BigQuery). Developed an interactive Tableau dashboard to report sales and marketing metrics.
* [TTC Delay Analysis](https://github.com/astrosica/data-science-portfolio/tree/main/Reporting%20and%20Dashboards/TTC%20Delay%20Analysis): Performed exploratory analysis and data cleaning of 40K subway delays for 2022-2023 using SQL and Tableau to investigate performance metrics, YoY KPIs, and performance strategies.## Learning Projects
### Machine Learning (Python)
* [Predicting loan repayments](https://github.com/astrosica/data-science-portfolio/blob/main/Machine%20Learning/Learning/Predicting%20Loan%20Repayments%20with%20Decision%20Trees%20and%20Random%20Forest.ipynb): Predicted whether a lender will repay their loan using decision trees and random forest.
* [Classifying anonymized data](https://github.com/astrosica/data-science-portfolio/blob/main/Machine%20Learning/Learning/Classifying%20Anonymized%20Data%20with%20KNN.ipynb): Classified anonymized data into two target classes using k-nearest neighbours (KNN).
* [Predicting ad clicks](https://github.com/astrosica/data-science-portfolio/blob/main/Machine%20Learning/Learning/Predicting%20Ad%20Clicks%20with%20Logistic%20Regression.ipynb): Predicted whether someone will click on an ad using logistic regression.