Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle
Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.
https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle
chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis
Last synced: about 7 hours ago
JSON representation
Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.
- Host: GitHub
- URL: https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle
- Owner: dmarks84
- License: bsd-3-clause
- Created: 2024-02-01T21:11:46.000Z (9 months ago)
- Default Branch: main
- Last Pushed: 2024-02-27T22:42:35.000Z (8 months ago)
- Last Synced: 2024-02-27T23:37:15.004Z (8 months ago)
- Topics: chloropleth, data-modeling, data-visualization, eda, linear-regression, matplotlib, numpy, pandas, python, seaborn, sklearn, statistics, statsmodels, supervised-ml, time-series-analysis
- Language: Jupyter Notebook
- Homepage: https://www.kaggle.com/danpmarks/superstore-time-series-analysis-part-1
- Size: 3.65 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Ind_Project_Superstore-Sales-Time-Series-Analysis--Kaggle
## Summary
I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2. I identified overarching trends, found seasonality, and tested for cyclic contributions. I then looked at residuals and used this information, and numerous visuals, to predict an additional week's worth of data.## Results
I was able to identify a slight trend and some seasonality to the data as well. Unforutnatley, once removed, a lot of "noise" was left-- there were no cycles I could identify from using lag features, and the residuals were randomly distributed. This expected, for the most part, with sales data-- we see some seasonality with quarters and holidays, depending on what is being sold, but there can be any number of reasons why there are spikes or lulls.## Screenshots
![screenshot3](https://github.com/dmarks84/Ind_Project_Superstore-Sales-Time-Series-Analysis--Kaggle/blob/main/superstore_screenshot3.png?raw=true)
![screenshot2](https://github.com/dmarks84/Ind_Project_Superstore-Sales-Time-Series-Analysis--Kaggle/blob/main/superstore_screenshot2.png?raw=true)
![screenshot1](https://github.com/dmarks84/Ind_Project_Superstore-Sales-Time-Series-Analysis--Kaggle/blob/main/superstore_screenshot1.png?raw=true)## Skills (Developed & Applied)
Programming, Python, Statistics, Numpy, Pandas, Matplotlib, Scikit-learn, Dataframes, Data Modeling, EDA, Data Visualization, Data Reporting, Time-Series Analysis, Seaborn, Supervised ML, StatsModels