https://github.com/sofwerx/safehouse-algorithm

Last synced: 2 months ago
JSON representation

Host: GitHub
URL: https://github.com/sofwerx/safehouse-algorithm
Owner: sofwerx
Created: 2018-05-17T13:38:29.000Z (about 7 years ago)
Default Branch: master
Last Pushed: 2018-05-18T15:24:25.000Z (about 7 years ago)
Last Synced: 2025-01-20T05:40:31.091Z (4 months ago)
Language: Python
Size: 458 KB
Stars: 0
Watchers: 3
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# General

## Safehouse Algorithm
The safehouse algorithm was used to identify if the adversary was conducting attack behavior to the safehouse. The data sources used from elasticsearch was:

1. "ifttt"
2. "persondetect"
3. webcam-pcap"
4. "safehouse-ap-devices"
5. "gammarf".

The data was converted to a sequence of events and modeled to find a baseline behavior. If the future sequence of events deviated from baseline behavior, the algorithm would determine this behavior as an attack.

The models chosen is the apriori algorithm. This model was chosen for its robustness to outlier events and the capability to model a sequence of events. The algorithm was first introduced to assist in the feature reduction process. The next stage in this project is to validate multiple machine learning models.

### Note:
This code and algorithm were used for Proof of Concept. Please do not deploy this code into a production environment. This model can be used as a baseline for the model selection and validation process.

# Code to Run

### Steps:

Instructions

```
git clone https://github.com/sofwerx/safehouse-data-transformations.git

```

```
cd Docker
```

```
docker build -t safehouse .
```

```
docker run -ti --rm -e TZ=America/New_York -v /home/david/Documents/safehouse-algorithm:/home/david/Documents/safehouse-algorithm safehouse bash

```

```
cd /home/david/Documents/safehouse-algorithm/prod
```

```
python3 codetorun.py
```

# Safehouse Data

### Access Data from Github
If you would like to train a new model please clone this repo https://github.com/sofwerx/safehouse-data.git .

The data in this repo is in JSON format. Each hit in the JSON files is an observation. I trained this model by treating each observation as an action and modeled a sequence of actions.

Retrain the model provided in the Safehouse Algorithm Repo with Safehouse Data Rep:

1. Select data sources listed in General
2. Standardize format for the time across JSON files
3. Concatenate all features for each observation keep time as a separate feature.
4. Append data sources and sort data by the standardized while keeping time feature.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/sofwerx/safehouse-algorithm

Awesome Lists containing this project

README