https://github.com/mxagar/data_science_udacity

My personal notes, code and projects of the Udacity Data Science Nanodegree.
https://github.com/mxagar/data_science_udacity

dashboard data-analysis data-engineering data-science machine-learning-pipelines

Last synced: 3 months ago
JSON representation

My personal notes, code and projects of the Udacity Data Science Nanodegree.

Host: GitHub
URL: https://github.com/mxagar/data_science_udacity
Owner: mxagar
Created: 2022-05-05T13:20:33.000Z (about 3 years ago)
Default Branch: main
Last Pushed: 2024-03-21T13:21:38.000Z (over 1 year ago)
Last Synced: 2025-02-15T12:50:56.743Z (5 months ago)
Topics: dashboard, data-analysis, data-engineering, data-science, machine-learning-pipelines
Language: Jupyter Notebook
Homepage:
Size: 32.6 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

        # Udacity Data Science Nanodegree: Personal Notes

These are my personal notes taken while following the [Udacity Data Science Nanodegree](https://www.udacity.com/course/data-scientist-nanodegree--nd025).

The Nanodegree asssumes basic data analysis skills with python libraries (pandas, numpy, matplotlib, sklearn, etc.) and has 5 modules that build up on those skills; each module has its corresponding folder in this repository with its guide Markdown file:

1. Introduction to Data Science: [`01_Intro_Data_Science`](./01_Intro_Data_Science/DSND_Introduction.md).

2. Software Engineering: [`02_SoftwareEngineering`](./02_SoftwareEngineering/DSND_SWEngineering.md).

3. Data Engineering: [`03_DataEngineering`](./03_DataEngineering/DSND_DataEngineering.md).

4. Experimental Design & Recommendations: [`04_ExperimentalDesign_RecSys`](./04_ExperimentalDesign_RecSys/DSND_ExperimentalDesign_RecSys.md).

5. Data Scientist Capstone (Spark): [`05_Capstone_Project`](./05_Capstone_Project/DSND_Capstone.md).

Additionally, it is necessary to submit and pass some projects to get the certification:

- Create a data science project and write a blog post: [airbnb_data_analysis](https://github.com/mxagar/airbnb_data_analysis).

- Disaster response prediction pipeline deployed on a Flask app: [disaster_response_pipeline](https://github.com/mxagar/disaster_response_pipeline).

- Recommender System which suggests new articles to the users of the IBM Watson Studio Platform: [recommendations_ibm](https://github.com/mxagar/recommendations_ibm).

- Capstone project: Prediction of customer churn of a music streaming service using Spark: [sparkify_customer_churn](https://github.com/mxagar/sparkify_customer_churn).

A regular python environment with the usual data science packages should suffice (i.e., scikit-learn, pandas, matplotlib, etc.); any special/additional packages and their installation commands are introduced in the guides. A recipe to set up a [conda](https://docs.conda.io/en/latest/) environment with my current packages is the following:

```bash

conda create --name ds pip python=3.10

conda activate ds

pip install -r requirements.txt

```

As a side note, I list here some related **free Udacity courses** on several topics:

- **Big Data**

  - [Intro to Hadoop and MapReduce](https://www.udacity.com/course/intro-to-hadoop-and-mapreduce--ud617)

  - [Deploying a Hadoop Cluster](https://www.udacity.com/course/deploying-a-hadoop-cluster--ud1000)

  - [Real-Time Analytics with Apache Storm](https://www.udacity.com/course/real-time-analytics-with-apache-storm--ud381)

  - [Big Data Analytics in Healthcare](https://www.udacity.com/course/big-data-analytics-in-healthcare--ud758)

  - [Spark](https://www.udacity.com/course/learn-spark-at-udacity--ud2002)

- **Databases and APIs**

  - [Data Wrangling with MongoDB](https://www.udacity.com/course/data-wrangling-with-mongodb--ud032)

  - [SQL for Data Analysis](https://www.udacity.com/course/sql-for-data-analysis--ud198)

  - [Designing RESTful APIs](https://www.udacity.com/course/designing-restful-apis--ud388)

- **Interview Preparation**

  - [Data Science Interview Prep](https://www.udacity.com/course/data-science-interview-prep--ud944)

  - [Machine Learning Interview Preparation](https://www.udacity.com/course/machine-learning-interview-prep--ud1001)

Mikel Sagardia, 2022.  

No guarantees.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/mxagar/data_science_udacity

Awesome Lists containing this project

README