An open API service indexing awesome lists of open source software.

https://github.com/nafisalawalidris/data-science-methodology-


https://github.com/nafisalawalidris/data-science-methodology-

Last synced: 7 months ago
JSON representation

Awesome Lists containing this project

README

          

# Data Science Methodology
To execute any of the code cells below, you can click on the cell to select it and then press the "Shift" and "Enter" keys simultaneously. This will run the code in the selected cell.
### Insights ###
- The dataset contains 57,691 recipes, each with 384 ingredients. The data includes a categorical variable cuisine which identifies the type of cuisine for each recipe.

- The value_counts() method can be used to count the number of recipes for each cuisine. Based on the cuisine_counts output, it can be observed that the dataset includes a wide range of cuisines, with Italian being the most common, followed by Mexican and Southern US.

- The histogram of the number of ingredients used in the recipes shows that the majority of recipes use between 5 and 20 ingredients. There are a few recipes with over 50 ingredients, which may be outliers.

- The heatmap of the most commonly used ingredients in each cuisine shows that certain ingredients are more prevalent in some cuisines than others. For example, soy sauce and sesame oil are commonly used in East Asian cuisines, while oregano and cumin are commonly used in Latin American cuisines.