Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/zenklinov/hotelling-t-square
This repository contains the implementation of a multivariate control chart with dimension reduction techniques, namely Factor Analysis of Mixed Data (FAMD) and Autoencoder. The control chart is designed for detecting network intrusions using network data traffic.
https://github.com/zenklinov/hotelling-t-square
autoencoder famd intrusion-detection-system multivariate-control-chart
Last synced: 2 days ago
JSON representation
This repository contains the implementation of a multivariate control chart with dimension reduction techniques, namely Factor Analysis of Mixed Data (FAMD) and Autoencoder. The control chart is designed for detecting network intrusions using network data traffic.
- Host: GitHub
- URL: https://github.com/zenklinov/hotelling-t-square
- Owner: zenklinov
- License: apache-2.0
- Created: 2024-04-03T02:16:51.000Z (8 months ago)
- Default Branch: main
- Last Pushed: 2024-04-05T07:29:30.000Z (7 months ago)
- Last Synced: 2024-04-05T08:32:45.333Z (7 months ago)
- Topics: autoencoder, famd, intrusion-detection-system, multivariate-control-chart
- Language: Jupyter Notebook
- Homepage:
- Size: 11.5 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Multivariate Control Chart Hotelling T-Square with Dimension Reduction using FAMD and Autoencoder
This repository contains the implementation of a multivariate control chart with dimension reduction techniques, namely Factor Analysis of Mixed Data (FAMD) and Autoencoder. The control chart is designed for detecting network intrusions using network data traffic.
## Problem Statement
Conventional multivariate control diagrams for detecting network intrusions face several challenges:
1. **False Alarms:** Occur due to network data traffic distributions not aligning with existing theories.
2. **Masking Effects:** Limitations in detecting outlier intrusions.
3. **Various Data Types:** Handling both categorical and continuous data types.## Solution Proposed
To address these challenges, this project proposes a multivariate control chart with the following techniques:
1. **Factor Analysis of Mixed Data (FAMD):** A technique for reducing data that contains both quantitative and qualitative variables.
2. **Autoencoder:** A technique for dimensional reduction specifically designed for quantitative variables, aimed at enhancing the performance of multivariate control charts.## Implementation
The proposed multivariate control chart is based on the conventional T2 control chart. We compare the performance of conventional T2 with T2 integrated with FAMD and Autoencoder for dimension reduction.
Using data simulation from UNSW-NB 15 dataset, the results demonstrate the following accuracies:
- Conventional T2: 64%
- T2 with FAMD: 74%
- T2 with Autoencoder: 76%## Repository Contents
- `Hotelling-T-Square-with-FAMD-Autoencoder.ipynb`: Jupyter notebook containing the implementation of the multivariate control chart with FAMD and Autoencoder.
- `UNSW DATA.zip`: Contains the dataset files `NUSW-NB15_features.csv`, `UNSW_NB15_testing-set.csv`, and `UNSW_NB15_training-set.csv`.
- `inferensi article`: Link to the published paper detailing the methodology and results.## Usage
1. Clone the repository:
git clone https://github.com/zenklinov/Hotelling-T-Square.git2. Extract the dataset files from `UNSW DATA.zip`.
3. Open and run `Hotelling-T-Square-with-FAMD-Autoencoder.ipynb` in a Jupyter notebook environment.
## References
- [Link to Published Paper](https://iptek.its.ac.id/index.php/inferensi/article/view/18751)
For more detailed information, refer to the published paper linked above.
## Contributors
- Kevin Agung Fernanda Rifki
- Niam Rosyadi
- Novri Suhermi
- Amanatullah Pandu ZenklinovFeel free to contribute by submitting issues or pull requests.
## License
This project is licensed under the [MIT License](LICENSE).