An open API service indexing awesome lists of open source software.

https://github.com/winkjs/wink-distance

Distance/Similarity functions for Bag of Words, Strings, Vectors and more.
https://github.com/winkjs/wink-distance

bag-of-words chebyshev cosine distance hamming jaccard jaro manhattan sets similarity soundex string taxicab tversky vectors wink

Last synced: 5 months ago
JSON representation

Distance/Similarity functions for Bag of Words, Strings, Vectors and more.

Awesome Lists containing this project

README

          

# wink-distance

Distance/Similarity functions for Bag of Words, Strings, Vectors and more.

### [![Build Status](https://api.travis-ci.com/winkjs/wink-distance.svg?branch=master)](https://app.travis-ci.com/github/winkjs/wink-distance) [![Coverage Status](https://coveralls.io/repos/github/winkjs/wink-distance/badge.svg?branch=master)](https://coveralls.io/github/winkjs/wink-distance?branch=master) [![Gitter](https://img.shields.io/gitter/room/nwjs/nw.js.svg)](https://gitter.im/winkjs/Lobby)

[](http://wink.org.in/)

Compute distances or similarities needed for NLP, de-duplication and clustering using **`wink-distance`**. Some of the methods are listed below:

1. Cosine similarity for Bag of Words,
1. Jaccard & Tversky for Sets,
1. Jaro, Jaro-Winkler, and Levenshtien for string,
1. Chebyshev and Taxicab for vectors.

> Combine `wink-distance` with [WinkNLP](https://github.com/winkjs/wink-nlp) to build awesome NLP apps.

### Installation

Use [npm](https://www.npmjs.com/package/wink-distance) to install:

npm install wink-distance --save

### Documentation
Check out the [distance/similarity API documentation](https://winkjs.org/wink-distance) to learn more.

### Need Help?

If you spot a bug and the same has not yet been reported, raise a new [issue](https://github.com/winkjs/wink-distance/issues) or consider fixing it and sending a pull request.

### About wink
[Wink](http://winkjs.org/) is a family of open source packages for **Statistical Analysis**, **Natural Language Processing** and **Machine Learning** in NodeJS. The code is **thoroughly documented** for easy human comprehension and has a **test coverage of ~100%** for reliability to build production grade solutions.

### Copyright & License

**wink-distance** is copyright 2017-23 [GRAYPE Systems Private Limited](http://graype.in/).

It is licensed under the terms of the MIT License.