{"id":25975784,"url":"https://github.com/markoshb/my-data-science-learning-projects","last_synced_at":"2026-04-05T08:34:19.554Z","repository":{"id":277557111,"uuid":"932801734","full_name":"MarkosHB/My-Data-Science-Learning-Projects","owner":"MarkosHB","description":"Short but illustrative notebooks to showcase data-analysis in Python","archived":false,"fork":false,"pushed_at":"2025-03-02T15:45:40.000Z","size":214,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-03-02T16:33:25.162Z","etag":null,"topics":["data-science","matplotlib-pyplot","pandas","python","pythorch","scikit-learn","tensorflow"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/MarkosHB.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2025-02-14T14:49:38.000Z","updated_at":"2025-03-02T15:45:43.000Z","dependencies_parsed_at":null,"dependency_job_id":"2a29937c-9ff8-429e-9830-c93dd8933dd5","html_url":"https://github.com/MarkosHB/My-Data-Science-Learning-Projects","commit_stats":null,"previous_names":["markoshb/my-data-science-learning-projects"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MarkosHB%2FMy-Data-Science-Learning-Projects","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MarkosHB%2FMy-Data-Science-Learning-Projects/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MarkosHB%2FMy-Data-Science-Learning-Projects/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/MarkosHB%2FMy-Data-Science-Learning-Projects/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/MarkosHB","download_url":"https://codeload.github.com/MarkosHB/My-Data-Science-Learning-Projects/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":241956646,"owners_count":20048672,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-science","matplotlib-pyplot","pandas","python","pythorch","scikit-learn","tensorflow"],"created_at":"2025-03-05T03:24:05.699Z","updated_at":"2025-12-30T21:33:45.560Z","avatar_url":"https://github.com/MarkosHB.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# 🚀 My Data Science Learning Projects.\n\nWelcome to my personal collection of hands-on data science projects! This repository showcases my journey exploring and mastering various data science concepts, tools, and techniques. \n🌟 Stay tuned as I continue to expand this repository with more exciting projects!\n  \n\u003e [!Note] \n\u003e Have a look at this repo about my [Machine Learning Subject](https://github.com/MarkosHB/Machine-Learning-Subject) wich contains even more projects written in R.\n\n### 🌸 Iris Classification.\n- 📄 [Notebook](https://github.com/MarkosHB/My-Data-Science-Learning-Projects/blob/main/iris/notebook.ipynb)\n- 🛠️ **Technologies:** Pandas, PyTorch.\n- 🧪 **Summary:** A classic classification problem using the Iris dataset to practice data manipulation, visualization, and building simple neural networks.\n\n### 🩺 Diabetes Prediction.\n- 📄 [Notebook](https://github.com/MarkosHB/My-Data-Science-Learning-Projects/blob/main/diabetes/notebook.ipynb)\n- 🛠️ **Technologies:** Scikit-learn, TensorFlow.\n- 🧪 **Summary:** Predicting the likelihood of diabetes using machine learning models, focusing on data preprocessing and model evaluation.\n\n### 🧑‍⚕️ Breast Cancer.\n- 📄 [Notebook](https://github.com/MarkosHB/My-Data-Science-Learning-Projects/blob/main/breast_cancer/notebook.ipynb)\n- 🛠️ **Technologies:** Autokeras, Scikit-learn.\n- 🧪 **Summary:** Automated approach to classify breast cancer cases. The project leverages AutoKeras to find optimal deep learning models with minimal manual tuning.\n\n### 🍷 Wine.\n- 📄 [Notebook](https://github.com/MarkosHB/My-Data-Science-Learning-Projects/blob/main/wine/notebook.ipynb)\n- 🛠️ **Technologies:** Pyspark, Pandas.\n- 🧪 **Summary:** The script processes the Wine dataset using Apache Spark, performing data cleaning, exploration, and applying custom pandas UDFs for additional transformations.\n\n### 🏠 California Housing.\n- 📄 [Notebook](https://github.com/MarkosHB/My-Data-Science-Learning-Projects/blob/main/california_housing/notebook.ipynb)\n- 🛠️ **Technologies:** Dask, Scikit-learn.\n- 🧪 **Summary:** This time, we will use an alternative to Pandas so that parallel computing is considered when manipulating dataframes thanks to the library Dask.\n\n### 💭 Movie Reviews.\n- 📄 [Notebook](https://github.com/MarkosHB/My-Data-Science-Learning-Projects/blob/main/movie_reviews/notebook.ipynb)\n- 🛠️ **Technologies:** NLTK, Scikit-learn.\n- 🧪 **Summary:** The notebook analyzes movie reviews using the NLTK library, focusing on text preprocessing, feature extraction, and sentiment classification with a Naive Bayes model.\n\n### 📊 ManageYourData.\n- 💻 [Repository](https://github.com/MarkosHB/ManageYourData)\n- 🛠️ **Technologies:** Pandas, Matplotlib, FPDF, Openpyxl, Streamlit.\n- 🧪 **Summary:** A self-made tool for generating PDF reports from data files locally.\n\n---\n\n### 🎛️ Data analysis.\n- 📋 [Carprice report](https://github.com/MarkosHB/My-Data-Science-Learning-Projects/blob/main/carprice/carprice.pdf) and [Titanic report](https://github.com/MarkosHB/My-Data-Science-Learning-Projects/blob/main/titanic/titanic.pdf)\n-  🛠️ **Technologies:** Power BI.\n- 🧪 **Summary:** My first two dashboards ever made with Power BI allowed me to learn the basics of visualizing and manipulating data. \n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmarkoshb%2Fmy-data-science-learning-projects","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmarkoshb%2Fmy-data-science-learning-projects","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmarkoshb%2Fmy-data-science-learning-projects/lists"}