An open API service indexing awesome lists of open source software.

https://github.com/shanukatiyar111/project3

PYTHON + SQL PROJECT- ANALYSIS OF ORDER DATA USING JUPYTER NOTEBOOK AND SSMS
https://github.com/shanukatiyar111/project3

jupyter kaggle kaggle-dataset pandas python sql ssms

Last synced: 4 months ago
JSON representation

PYTHON + SQL PROJECT- ANALYSIS OF ORDER DATA USING JUPYTER NOTEBOOK AND SSMS

Awesome Lists containing this project

README

          

# Project3

1. About Dataset-
This dataset contains the 2022 and 2023 global mart sales datasets.
2. Used Python version 3.10.10 and Jupyter version 7.2.1 for this project.
3. Used Kaggle API to Download Data from Kaggle.
4. Used Jupyter Notebook to read data into Pandas dataframe, and transform data using Python.
5. Transformation includes-
renaming column ,
changing data type,
adding calculated columns
dropping unnecessary columns
6. Used SQLalchemy to load data into df_orders table into prebuilt Database project4python_sql.
5. Used window authentication to load data into SSMS and ODBC driver for connection.
6. Analysis of Data in SSMS includes -
find the top 10 highest revenue-generating product
find the top 5 products in each region by highest revenue-generating
find month-over-month growth for years 2022 and 2023
for each category which month has the highest sales
which subcategory has the highest YoY growth