https://github.com/sr-lab/gspider

Guess success probability slider, for plotting the evolution of password guessing attacks.
https://github.com/sr-lab/gspider

data-structure dependent-types password-guessers probabilistic-models risk-assessment

Last synced: 2 days ago
JSON representation

Guess success probability slider, for plotting the evolution of password guessing attacks.

Host: GitHub
URL: https://github.com/sr-lab/gspider
Owner: sr-lab
License: mit
Created: 2019-05-04T09:49:20.000Z (over 6 years ago)
Default Branch: master
Last Pushed: 2023-12-30T21:48:38.000Z (almost 2 years ago)
Last Synced: 2025-03-01T00:29:30.671Z (7 months ago)
Topics: data-structure, dependent-types, password-guessers, probabilistic-models, risk-assessment
Language: Idris
Homepage: https://sr-lab.github.io/gspider/
Size: 401 KB
Stars: 2
Watchers: 4
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # GSPIDER

Guess success probability slider, for plotting the evolution of password guessing attacks.

![Logo](assets/logo-text-h.svg)

## Overview

GSPIDER (**g**uess **s**uccess **p**robability sl**ider**) is a utility, written in the dependently-typed programming language [Idris](https://www.idris-lang.org/) that plots the evolution of a password guessing attack against a password dataset. At the moment, it's a proof-of-concept, but it's still usable for small-scale models.

## Building

You'll need [Idris](https://www.idris-lang.org/download/) installed to build the project. From the root of the repo:

```bash

cd ./src

idris Main.idr -p contrib -o gspider.exe

```

## Usage

Call the program like this, from the root of the repo:

```bash

./src/gspider.exe ./systems/.sys ./dists/.freqs ./attacks/.att > ./results.log

```

Here's an overview of what those options mean:

| Position | Name         | Description                                                                                                                                                                 |

|----------|--------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------|

| 1        | System       | This specifies the supported character set of the system you're modelling. Two sample system files come with the software, which you can find in `/systems`.                |

| 2        | Distribution | This specifies the distribution of passwords on the system you're modelling. Four sample distribution files come with the software, which you can find in `/distributions`. |

| 3        | Attack       | This specifies the password guessing attack you're modelling. A sample attack comes with this software, which you can find in `/attacks`.                                   |

As a quick example, from the root of the repo, run the following:

```bash

./src/gspider.exe ./systems/ascii.sys ./dists/faithwriters.freqs ./attacks/top10k.att > ./results.log

```

This will leave you with a file called `results.log` in the repo root, that will contain the guess success probability of the attack after each guess (or at every *frame*). The file will look something like this:

```

Frame is initial.

0.005458852610979503

0.007003810897105778

0.007621794211556288

0.007827788649706457

0.008342774745081882

0.008342774745081882

...

0.2126892573900505

0.2126892573900505

0.2126892573900505

0.2126892573900505

0.2126892573900505

0.2126892573900505

Frame is terminal.

```

Plotting this data as a line graph makes for some interesting visualisations! The following graph was generated by running the output of the `/attacks/top10k.att` attack on each distribution in `/distributions` through a plotting script based on [Matplotlib](https://matplotlib.org/).

![Graph](docs/svg/paf-dataset-graph.svg)

## Dependent Types

Dependent types are employed for type-safe reasoning across systems in the GSPIDER model:

### Restricted Character-Set String

At the core of the probabilistic attack frame type is the restricted character-set string, which is a string type restricted to containing some specific set of characters. It's encoded as below.

```idris

||| Returns true if the given list of characters `str` contains only characters specified in `chars`.

|||

||| @chars the list of permitted characters

||| @str the string to check

madeOf' : (chars : List Char) -> (str : List Char) -> Bool

madeOf' chars [] = True

madeOf' chars (x :: xs) = elem x chars && madeOf' chars xs

||| Returns true if the given string `str` contains only characters specified in `chars`.

|||

||| @chars the list of permitted characters

||| @str the string to check

export

madeOf : (chars : List Char) -> (str : String) -> Bool

madeOf chars str = madeOf' chars (unpack str)

||| Strings that are restricted to only a specific set of characters.

|||

||| @allowed the list of characters allowed in the string

public export

data RestrictedCharString : (allowed : List Char) -> Type where

  ||| Constructs a restricted character set string with the specified value.

  |||

  ||| @val the value of the string

  MkRestrictedCharString : (val : String) ->

                           {auto prf : So (madeOf allowed val)} ->

                           RestrictedCharString allowed

```

### Distributions

A distribution is just a function that maps restricted character-set strings to floating-point values. With `RestrictedCharString` defined, we can go ahead and define the `Distribution` dependent type as below.

```idris

||| Represents a password probability distribution for a system.

|||

||| @s the system

public export

Distribution : (s : System) -> Type

Distribution s = (RestrictedCharString s) -> Double

```

Probability distributions themselves are calculated from real-world password frequency distributions using the [Idris probability package](https://github.com/fieldstrength/probability) which draws heavily on the work of Erwig and Kollmansberger in [_Probabilisitic Functional Programming in Haskell_](https://web.engr.oregonstate.edu/~erwig/pfp/).

### Probabilistic Attack Frames

Probabilistic attack frames are a new datatype, used by GSPIDER, to model guessing attack evolution in a type-safe way. They make use of restricted character-set strings to ensure that both the password distribution and guessing attack relate to passwords containing the same specific subset of characters. It wouldn't make sense, for example, to attempt to input the password `hunter2` on an ATM, which only supports numeric passwords. This is one of the problems that dependently-typed PAFs address (see below).

```idris

||| Represents a probabilistic attack frame.

|||

||| @ n the number of pending guesses at this frame

||| @ m the number of made guesses at this frame

public export

data AttackFrame : (s : System) -> (n : Nat) -> (m : Nat) -> Type where

  -- Included for completeness.

  Empty : (d : Distribution s) ->

          AttackFrame s Z Z

  Initial : (p : Vect (S n) (RestrictedCharString s)) ->

            (d : Distribution s) ->

            AttackFrame s (S n) Z

  Ongoing : (p : Vect (S n) (RestrictedCharString s)) ->

            (g : Vect (S m) (RestrictedCharString s)) ->

            (d : Distribution s) ->

            (q : Double) ->

            AttackFrame s (S n) (S m)

  Terminal : (g : Vect (S m) (RestrictedCharString s)) ->

             (d : Distribution s) ->

             (q : Double) ->

             AttackFrame s Z (S m)

```

## Computing Lockout Policies

This utility comes with a file `/scripts/lockout.py` which allows you to compute a _lockout policy_ for a system based on the output yielded by GSPIDER. A lockout policy is just the minimum number of guesses we can allow a user to make while keeping the probability of a guessing attack being successful against a randomly-chosen account on our system below a certain acceptable threshold. Try it out like this (must be from the `/scripts` directory):

```bash

python lockout.py ../systems/ascii.sys ../dists/elitehacker.freqs ../attacks/top10k.att 0.05

```

You'll get some nice friendly output that looks like this:

```

A maximum of 14 guesses can be made by this attack in order for guess success probability to remain below 0.05.

```

## Limitations

GSPIDER is still very much in the proof-of-concept stage. With this in mind, there are a few limitations:

* Frequency file/attack size are limited to a few thousand entries each. I this this might be stack space related, but more digging is required.

## Acknowledgements

I would like to thank the following people for making this project possible:

* [Daniel Miessler](https://github.com/danielmiessler) and all the contributors and maintainers of [SecLists](https://github.com/danielmiessler/SecLists) which contains password datasets used to create the example distribution files in this repository.

* [Cliff Harvey](https://github.com/fieldstrength) and all the contributors and maintainers of [the probability library](https://github.com/fieldstrength/probability) which this project makes use of.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/sr-lab/gspider

Awesome Lists containing this project

README