An open API service indexing awesome lists of open source software.

https://github.com/philippus/osita

🧸 Tiny optimal string alignment distance library
https://github.com/philippus/osita

azerty damerau-levenshtein-distance optimal-string-alignment-distance qwerty qwerty-based-char-distance spelling

Last synced: 3 months ago
JSON representation

🧸 Tiny optimal string alignment distance library

Awesome Lists containing this project

README

          

# osita

[![build](https://github.com/Philippus/osita/workflows/build/badge.svg)](https://github.com/Philippus/osita/actions/workflows/build.yml?query=workflow%3Abuild+branch%3Amain)
[![codecov](https://codecov.io/gh/Philippus/osita/branch/main/graph/badge.svg)](https://codecov.io/gh/Philippus/osita)
![Current Version](https://img.shields.io/badge/version-0.1.0-brightgreen.svg?style=flat "0.1.0")
[![Scala Steward badge](https://img.shields.io/badge/Scala_Steward-helping-blue.svg?style=flat&logo=)](https://scala-steward.org)
[![license](https://img.shields.io/badge/license-MPL%202.0-blue.svg?style=flat "MPL 2.0")](LICENSE)

Osita is an implementation of the [Optimal String Alignment distance](https://en.wikipedia.org/wiki/Damerau%E2%80%93Levenshtein_distance#Optimal_string_alignment_distance)
algorithm. It implements the standard version of the algorithm and an extension of it where the substitution cost has
been replaced by a function which calculates the keyboard distance between characters using the Euclidean distance
between keys on a QWERTY or AZERTY-keyboard.
You can also supply your own substitution cost function.

## Installation
Osita is published for Scala 2.12, 2.13 and 3. To start using it add the following to your `build.sbt`:

```
libraryDependencies += "nl.gn0s1s" %% "osita" % "0.1.0"
```

## Example usage

```scala
import nl.gn0s1s.osita.Osita._

osa("abcde", "abcde") // val res0: Double = 0.0
osa("abcde", "abcd") // val res1: Double = 1.0
osaWithSubstitutionCost("abc", "agc")(qwertySubstitutionCost) // val res2: Double = 1.118033988749895

```

## Resources
- [Optimal String Alignment distance](https://en.wikipedia.org/wiki/Damerau%E2%80%93Levenshtein_distance#Optimal_string_alignment_distance)
- [Euclidean Distance](https://en.wikipedia.org/wiki/Euclidean_distance)
- Distances between keys on a QWERTY keyboard on Code Golf - https://codegolf.stackexchange.com/questions/233618/distances-between-keys-on-a-qwerty-keyboard
- Keyboard distance in Perl - https://metacpan.org/release/KRBURTON/String-KeyboardDistance-1.01/source/README

## License
The code is available under the [Mozilla Public License, version 2.0](LICENSE).