https://github.com/akotronis/qualitycontrol

HRH Quality Control app
https://github.com/akotronis/qualitycontrol

data-analysis gui-application latex newton-method oop pandas progress-bar pyinstaller pysimplegui python quality-control sqlite3

Last synced: about 2 months ago
JSON representation

HRH Quality Control app

Host: GitHub
URL: https://github.com/akotronis/qualitycontrol
Owner: akotronis
Created: 2021-10-20T13:55:26.000Z (over 4 years ago)
Default Branch: master
Last Pushed: 2022-01-17T16:15:32.000Z (over 4 years ago)
Last Synced: 2025-02-15T12:47:35.660Z (over 1 year ago)
Topics: data-analysis, gui-application, latex, newton-method, oop, pandas, progress-bar, pyinstaller, pysimplegui, python, quality-control, sqlite3
Language: Python
Homepage:
Size: 3.25 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# SKU Quality Control Application

## Introduction

The purpose of _SKU Quality Control application_ is to investigate the atypical purchases that are
recorded in one audit period.
The process of finding atypical purchases reaches the SKU level per store
of the _Emrc Retail Audit sample._

The distance measure which is used to implement the above, based on Shanon’s theory, is the
following:

$\text{ }D(P_{t+1},P_t)=P_{t+1}\cdot\ln(P_{t+1}/P_t)+P_t-P_{t+1}\text{ }$

## An example

A typical example that describes this property is the following:
We have 2 stores where the purchases for the same SKU in 2 consecutive periods are

$\text{store A: }P_t=1, P_{t+1}=2\text{, and store B: }P_t=5, P_{t+1}=10.$

We immediately observe the doubling of the purchases in the period t + 1, that is, we have a 100%
increase in both stores. Here, the increase in store A should not worry us because it is reasonable and
expected to buy an additional SKU and we must pay attention to store B, something that the classic
distance measures do not indicate.
However, by using the above type of distance, we obtain

$D(P_{t+1},P_t)=0.386\text{ for the store A and }D(P_{t+1},P_t)=1.931\text{ for the store B,}$

## Implementation

This application implements the following 2 procedures.

1. The application builds for each SKU, per audit period (monthly, bi-monthly, etc.) and per cluster,
the distribution of the distance measure $D(P_{t+1}, P_t)$ . Each such distribution is built with the past
data and updated every audit period with the new data. From the study of these distributions the
percentiles 90%, 95% and 99% are calculated.
2. For all stores "i" of the audit period, the corresponding $D_i(P_{t+1}, P_t)$ is calculated per SKU and
compared with the 3 critical percentiles of the corresponding $D(P_{t+1}, P_t)$ .
If this Di is greater
than one of the 3 percentiles then it is characterized as atypical and the application suggests 2
optimum alternatives.

## Gui

The application performs analysis on user input files and results are stored in an **sqlite3** database. The user interacts with the application through a GUI and he can import files for analysis, export files, delete database contents and read the application documentation. Application info is displayed in the console.

### Main Interface

Below is the application main interface. The user can navigate from the top menu and select an
action.

![](resources/01_main_interface.jpg)

#### Console

The various messages produced by user actions are displayed on the console.

- The console can be cleared by selecting Console → Clear from the top level dropdown menu or by clicking on the console, pressing Ctrl+A and then Delete.
- The contents of the console can be copied by clicking on the console, pressing Ctrl+A, then Ctrl+C and then Ctrl+V to paste them anywhere you like.

#### Actions

The application expects actions in the following logical order:

1. Import Clusters
2. Import SKUS for analysis
3. Import Outlets to perform analysis based on the previous step

The application creates a database file (**db.sqlite3**) in the same folder of the executable, where the
imported items are stored. If this file is deleted, it will be automatically created again the next time the
application is launched and will have to be populated again with entries from new imports.

#### About

- Select _About → Database Current Status_ to print counts for the database tables on the console.
- Select _About → Importing_ to print information about importing files (see below).
- Select _About → SKU Quality Control Application_ to print quick info about the app.
- Select _About → Read the Docs_ to launch the documentation file.

![](resources/00_About.jpg)

## Importing files

The application accepts as input **.csv** or **.xlsx/.xls** files. The .csv files are imported much faster. In
the case of .xlsx/.xls imports, the application will try to convert the file to .csv before parsing it.

### Clusters

The imported file must follow the rules the user can see by selecting About → Importing → Clusters from the top level dropdown menu.

![](resources/02_about_importing_clusters.jpg)

Select _Import → Clusters_ from the top level dropdown and choose a cluster file to import.

![](resources/03_importing_clusters.jpg)

The appropriate messages will be displayed on the console in the case of success or failure accord-
ingly.

![](resources/04_importing_clusters_success.jpg)

![](resources/05_importing_clusters_failure.jpg)

Note that:

1. The imported file must comply with the rules mentioned above,
2. Every time a new file is submitted, **all existing clusters will be deleted and replaced by the new ones**.

### SKUs

The imported file must follow the rules the user can see by selecting _About → Importing → SKUs_ from the top level dropdown menu

![](resources/06_about_importing_skus.jpg)

Select _Import → SKUs →_ and the period type for the file you want to import from the top level dropdown.

![](resources/07_importing_skus_1.jpg)

A popup will appear with the period type that was selected from the menu. Make sure to select the file of the correct period type, or an error message will appear while parsing it.

![](resources/07_importing_skus_2.jpg)

The appropriate success/error messages will appear while parsing the file. If parsing is successful, a progress window will appear displaying the progress of the SKU analysis. Wait until the process is finished.

![](resources/07_importing_skus_3.jpg)

Note that:

1. The imported file must comply with the rules mentioned above,
2. Every time a new file is submitted, the entries will be appended to the database.
3. If any SKU from the selected period type already exists in the database, a warning message will appear and the database will not be updated. All SKUs of a specific period type must update the database from a single file.
4. Clusters must be up to date before importing SKUs.

### Outlets

The imported file must follow the rules the user can see by selecting _About → Importing → Outlets_ from the top level dropdown menu.

![](resources/08_about_importing_outlets.jpg)

Select _Import → Outlets →_ and the period type for the file you want to import from the top level
dropdown.

![](resources/09_importing_outlets_1.jpg)

the relevant pop up for selecting outlet file according to the selected period type, the success/error messages and the progress window will appear just as in the SKUs section. A report will also appear on the console about the atypical and missing entries found per outlet:

![](resources/09_importing_outlets_2.jpg)

After a successful file import, two database tables will be populated, a table for the _atypical_ entries found and a table for the _missing_ ones.

Note that:

1. The imported file must comply with the rules mentioned above,
2. Every time a new file is submitted, all existing atypicals and missing entries will be deleted and replaced by the new ones.
3. Clusters must be up to date before importing Outlets.

## Exporting files

### Clusters

Select _Export → Clusters_ from the top level dropdown and choose a cluster file name to export.

![](resources/10_exporting_clusters.jpg)

an excel file will be created with the columns

- _id_outlet_: the id of the outlet
- _mountly, food, non_food_: the cluster number for the corresponding period type.

### SKUs

Select _Export → SKUs → SKUs or SKU Analysis_ from the top level dropdown and choose a file name to export

![](resources/11_exporting_skus.jpg)

- SKUs will export all the SKUs that were imported in the database. Columns:
- _id_product, id_brand, id_sku, period_type, sku_name_: As in the imported files.
- _sku_file_name_: The name of the file rom where it was imported
- _imported_date_: The date where it was imported
-SKU Analysis will export a joined table with all the SKUs that were imported in the database and the statistics from the analysis, where present. Columns:
- _id_product, id_brand, id_sku, period_type, sku_name_: As above.
- _cluster_: The cluster from the corresponding outlet and period type
- _count, mean_diff, perc90_diff, perc95_diff, perc99_diff_: The statistics from the analysis performed

### Outlets

Select _Export → Outlets → Missing or Atypicals_ from the top level dropdown and choose a file name to export.

![](resources/12_exporting_outlets.jpg)

- Missing will export the missing entries from the database based on the outlet file on which the outlet analysis was performed. Columns:
- _id_outlet, id_product, id_brand, id_sku, period_type, cluster_: As above.
- _lm_purch, purch_: The corresponding last month purchases and the current purshases respectively.
- Atypicals will export the atypical entries from the database based on the outlet file on which the outlet analysis was performed. Columns:
- _id_product, id_brand, id_sku, period_type, cluster, sku_name, lm_purch, purch_: As above.
- _stars_:
- \*\*\* → D > _perc99_diff_,
- \*\* → D > _perc95_diff_ and <= _perc99_diff_,
- \* → D > _perc90_diff_ and <= _perc95_diff_,
- _proposed_purch_1, proposed_purch_2_: Two proposed values for purchases produced by the analysis based on Newton’s method.

## Deleting Content

Select _Delete → Clusters or SKUs_ from the top level dropdown.

![](resources/13_deleting_content.jpg)

- Choosing Clusters will delete the Clusters database table contents which will have to be repopulated by a new import before performing SKU or Outlet analysis.
- Choosing SKUs will delete all the other database tables’ contents which will have to be repopulated by a new import before performing SKU or Outlet analysis.

# Code

The application is made with **_Python_**. GUI is made with _PySimpleGUI_, executable file with _PyInstaller_ and code is optimized with _LineProfiler_. Other packages used are _sqlite3_, _math_, _numpy_, _pandas_, _subprocess_, _base64_, _tempfile_, _webbrowser_

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/akotronis/qualitycontrol

Awesome Lists containing this project

README