{"id":28373994,"url":"https://github.com/samaalharbi2/project-data-science-blog-post","last_synced_at":"2026-01-26T08:33:40.784Z","repository":{"id":292252185,"uuid":"979637982","full_name":"samaalharbi2/Project-Data-Science-Blog-Post","owner":"samaalharbi2","description":"A data science project from Udacity’s Nanodegree — exploring what drives developer success","archived":false,"fork":false,"pushed_at":"2025-05-10T22:44:38.000Z","size":19122,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-06-25T18:45:59.715Z","etag":null,"topics":["crisp-dm","data-analysis","data-science","data-visualization","nanodegree","udacity"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/samaalharbi2.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-05-07T20:40:19.000Z","updated_at":"2025-05-10T22:44:41.000Z","dependencies_parsed_at":"2025-05-08T23:32:52.841Z","dependency_job_id":"206a3b35-c28c-40f7-b060-655e54585901","html_url":"https://github.com/samaalharbi2/Project-Data-Science-Blog-Post","commit_stats":null,"previous_names":["samaalharbi2/project-data-science-blog-post"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/samaalharbi2/Project-Data-Science-Blog-Post","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/samaalharbi2%2FProject-Data-Science-Blog-Post","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/samaalharbi2%2FProject-Data-Science-Blog-Post/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/samaalharbi2%2FProject-Data-Science-Blog-Post/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/samaalharbi2%2FProject-Data-Science-Blog-Post/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/samaalharbi2","download_url":"https://codeload.github.com/samaalharbi2/Project-Data-Science-Blog-Post/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/samaalharbi2%2FProject-Data-Science-Blog-Post/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":28770842,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-01-26T07:45:00.504Z","status":"ssl_error","status_checked_at":"2026-01-26T07:45:00.070Z","response_time":59,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["crisp-dm","data-analysis","data-science","data-visualization","nanodegree","udacity"],"created_at":"2025-05-29T20:38:12.310Z","updated_at":"2026-01-26T08:33:40.767Z","avatar_url":"https://github.com/samaalharbi2.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Brains Behind the Code: What Makes a Developer Thrive 🧠\n\n\u003e A deep dive into Stack Overflow's 2024 Developer Survey — uncovering the habits, skills, and behaviors that shape developer success.\n\n## 📖 Table of Contents\n- [Business Understanding](#business-understanding) \n- [Project Motivation](#project-motivation)  \n- [Dataset \u0026 Link](#dataset--link)  \n- [Tools \u0026 Technologies](#tools--technologies)  \n- [Methodology](#methodology)    \n- [Key Findings](#Key--Findings)\n- [Acknowledgements](#acknowledgements)\n---\n\n## 💼 Business Understanding\n\u003cimg src=\"https://github.com/mbenetti/CRISP-DM-Rossmann/blob/master/CRISP-DM.png\" alt=\"CRISP Logo\" width=\"400\"/\u003e\n\n\nThe goal of this project is to bridge the gap between raw survey data and career-changing insights. We seek to understand how variables such as education, programming languages, and job search behavior influence salary and job outcomes. These findings are crucial for stakeholders such as:\n\n- **Hiring Managers** – To understand talent behavior trends.  \n- **Developers** – To adjust skills and strategies.  \n- **Educators** – To tailor tech education programs.  \n- **Career Coaches** – To guide job seekers based on data.\n\n---\n\n## 🎯 Project Motivation\n\nWhy do some developers seem to land jobs faster, earn more, or thrive in diverse tech stacks?  \nThis project aims to explore trends in developer behavior, education, and technology usage to answer questions like:\n- How does education affect job search efficiency?\n- Which programming languages dominate the industry?\n- What does the salary distribution tell us about developer value?\n\nWith data from over 80,000 developers worldwide, the goal is to extract actionable insights and uncover patterns that can empower aspiring and current developers to make informed career decisions.\n\n---\n\n## 📂 Dataset \u0026 Link\n\n- **Source**: [Kaggle - Stack Overflow Developer Survey 2024](https://www.kaggle.com/datasets/berkayalan/stack-overflow-annual-developer-survey-2024?select=survey_results_public.csv)  \n- **File Used**: `survey_results_public.csv`\n\n---\n\n## 🛠 Tools \u0026 Technologies\n\n- Python  \n- Pandas, NumPy  \n- Seaborn, Matplotlib  \n- Scikit-learn  \n- Jupyter Notebook  \n\n---\n\n## 🔬 Methodology\n\nOur analysis follows the **CRISP-DM** process:\n\n1. **Business Understanding** – Identify the purpose and value of the data.  \n2. **Data Understanding** – Load, explore, and summarize the data.  \n3. **Data Preparation** – Clean missing values, transform features.  \n4. **Modeling** – Train ML models where needed.  \n5. **Evaluation** – Assess insights/statistics and model performance.  \n6. **Deployment** – Share findings through visuals and blog storytelling.\n\n---\n## 🔍 Key Findings\n\nBased on the analysis of the 2024 Stack Overflow Developer Survey data:\n\n1. **Education Level**: Developers with higher education (master’s or above) tend to have slightly higher salaries, but experience has a stronger impact overall.\n\n2. **Experience and Salary**: Professional coding experience shows a clear positive correlation with salary, especially beyond 5 years.\n\n3. **Most Used Languages**: JavaScript and Python remain the most widely used languages, especially in web and data-related fields.\n\n4. **Remote Work Trends**: The majority of developers (over 60%) prefer hybrid or fully remote work, reflecting current global work trends.\n\n5. **Job Search Time**: Developers with higher education levels tend to spend less time searching for jobs each day.\n   \n---\n## 🙏 Acknowledgements\n\nHuge thanks to [Stack Overflow](https://stackoverflow.com/) for conducting the survey and [Kaggle](https://www.kaggle.com/) for hosting the data.  \nThis project was built as part of a data science learning journey to demonstrate both technical analysis and storytelling capabilities.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsamaalharbi2%2Fproject-data-science-blog-post","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsamaalharbi2%2Fproject-data-science-blog-post","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsamaalharbi2%2Fproject-data-science-blog-post/lists"}