Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle

Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.
https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle

chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis

Last synced: about 7 hours ago
JSON representation

Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.

Awesome Lists containing this project

README

        

# Ind_Project_Superstore-Sales-Time-Series-Analysis--Kaggle

## Summary
I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2. I identified overarching trends, found seasonality, and tested for cyclic contributions. I then looked at residuals and used this information, and numerous visuals, to predict an additional week's worth of data.

## Results
I was able to identify a slight trend and some seasonality to the data as well. Unforutnatley, once removed, a lot of "noise" was left-- there were no cycles I could identify from using lag features, and the residuals were randomly distributed. This expected, for the most part, with sales data-- we see some seasonality with quarters and holidays, depending on what is being sold, but there can be any number of reasons why there are spikes or lulls.

## Screenshots
![screenshot3](https://github.com/dmarks84/Ind_Project_Superstore-Sales-Time-Series-Analysis--Kaggle/blob/main/superstore_screenshot3.png?raw=true)
![screenshot2](https://github.com/dmarks84/Ind_Project_Superstore-Sales-Time-Series-Analysis--Kaggle/blob/main/superstore_screenshot2.png?raw=true)
![screenshot1](https://github.com/dmarks84/Ind_Project_Superstore-Sales-Time-Series-Analysis--Kaggle/blob/main/superstore_screenshot1.png?raw=true)

## Skills (Developed & Applied)
Programming, Python, Statistics, Numpy, Pandas, Matplotlib, Scikit-learn, Dataframes, Data Modeling, EDA, Data Visualization, Data Reporting, Time-Series Analysis, Seaborn, Supervised ML, StatsModels